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© This invention provides for the construction of a codon optimized DNA sequence encoding the insect- 
specific toxin Androctonus austral is insect toxin (AalT). The codon optimized sequence is then inserted into an 
insect virus. A susceptible insect which ingests such a modified insect virus will cease feeding on plants due to 
a toxin-induced paralysis at an earlier time than an insect which ingests a wild-type insect virus lacking an AalT 
gene. 
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Field of the Invention 

This invention relates to the construction of a codon optimized DNA sequence encoding the insect- 
specific toxin AalT, to the expression of that toxin, and the demonstration of in vivo toxicity of insect viruses 
5 encoding that toxin. 

Background of the Invention 



The following abbreviations are used throughout this application: 



70 


A. cal. - 


Autographa californica 




AcMNPV - 


Autographa californica nuclear polyhedrosis virus 




AalT - 


Anrlrnr"tnni i«? ill iQtrpiliQ in^ppt tnvin 




bp- 


base pairs 




CPU- 


contractile paralysis unit 


15 


ECV- 


extracellular virus 




GV- 


granulosis virus 




kD - 


kilodaltons 




LT- 


lethality time 




MOI - 


multiplicity of infection 


20 


NPV- 


nuclear polyhedrosis virus 




OB - 


occlusion body 




Occ- - 


occlusion negative virus(es) 




Occ + - 


occlusion positive virus(es) 




PCR- 


polymerase chain reaction 


25 


PDV- 


polyhedron derived virus 




PFU - 


plaque forming unit 




p.i. - 


post-infection 




PIB- 


polyhedron inclusion body (also known as OB) 




ST- 


survival time 



30 The family of DNA insect viruses known as Baculoviridae includes nuclear polyhedrosis viruses (NPV) 
and granulosis viruses (GV). These viruses produce occlusion bodies (OBs) in their life cycle. Also included 
are the non-occluded viruses (NOV), which do not produce OBs in their life cycle. Another family of DNA 
insect viruses are the entomopox viruses. 

Over 400 baculovi ruses have been identified as present in invertebrates. Examples of NPVs include 

35 Lymantria dispar NPV (gypsy moth NPV). Autographa californica MNPV, Syngrapha falcifera NPV (celery 
looper NPV), Spodoptera litturalis NPV, Spodoptera frugiperda NPV, Heliothis armigera NPV, Mamestra 
brassicae NPV, Choristoneura fumiferana NPV, Trichoplusia ni NPV, Helicoverpa zea NPV, etc. Examples of 
GVs include Cydia pomonella GV (codling moth GV), Pieris brassicae GV, Trichoplusia ni GV, etc. 
Examples of NOVs are Orcytes rhinoceros NOV and Heliothis zea NOV. Examples of entomopox viruses 

40 include Melolontha melonotha EPV, Amsacta moorei EPV, Locusta migratoria EPV, Melanoplus sanguinipes 
EPV, Schistocerca gregaria EPV, Aedes aegypti EPV, Chironomus luridus EPV, etc. 

The use of baculoviruses and entomopox viruses as bioinsecticides holds great promise. One of the 
major impediments to their widespread use in agriculture is the time lag between initial infection of the 
insect and its death. This lag can range from a few days to several weeks. During this lag, the insect 

45 continues to feed, causing further damage to the plant. A number of researchers have attempted to 
overcome this drawback by inserting a heterologous gene into the viral genome, so as to express an insect 
controlling or modifying substance, such as a toxin (Bibliography entries 1 ,2,3). 

The life cycle of baculoviruses, as exemplified by AcMNPV, includes two stages. Each stage of the life 
cycle is represented by a specific form of the virus: Extracellular viral particles (ECV) which are 

so nonoccluded, and occluded virus particles (OB) (4,5). The extracellular and occluded virus forms have the 
same genome, but exhibit different biological properties. The maturation of each of the two forms of the 
virus is directed by overlapping sets of viral genes, some of which are unique to each form. 

In its naturally occurring insect infectious form, multiple virions are found embedded in a paracrystalline 
protein matrix known as an occlusion body (OB), which is also referred to as a polyhedron inclusion body 

55 (PIB). The proteinaceous viral occlusions are referred to as polyhedra (polyhedron is the singular term). A 
polyhedrin protein, which has a molecular weight of 29 kD, is the major viral-encoded structural protein of 
the viral occlusions (4,6). (Similarly, GVs produce OBs which are composed primarily of granulin, rather 
than polyhedrin). 
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The viral occlusions are an important part of the natural baculovirus life cycle, providing the means for 
horizontal (insect to insect) transmission among susceptible insect species. In the environment, a suscept- 
ible insect (usually in the larval stage) ingests the viral occlusions from a contaminated food source, such as 
a plant. The crystalline occlusions dissociate in the gut of the susceptible insects to release the infectious 
5 viral particles. These polyhedron derived viruses (PDV) invade and replicate in the cells of the midgut tissue 
(4). 

It is believed that virus particles enter the cell by endocytosis or fusion, and the viral DNA is uncoated 
at the nuclear pore or in the nucleus. Viral DNA replication is detected within six hours. By 10-12 hours 
post-infection (p.i.), secondary infection spreads to other insect tissues by the budding of the extracellular 

70 virus (ECV) from the surface of the cell. The ECV form of the virus is responsible for cell to cell spread of 
the virus within an individual infected insect, as well as transmitting infection in cell culture. 

Late in the infection cycle (12 hours p.i.), polyhedrin protein can be detected in infected cells. It is not 
until 18-24 hours p.i. that the polyhedrin protein assembles in the nucleus of the infected cell and virus 
particles become embedded in the proteinaceous occlusions. Viral occlusions accumulate to large numbers 

15 over 4-5 days as cells lyse. These polyhedra have no active role in the spread of infection in the larva. 
ECVs in the haemolymph multiply and spread, leading to the death of the larva (4,5,6). 

When infected larvae die, millions of polyhedra remain in the decomposing tissue, while the ECVs are 
degraded. When other larvae are exposed to the polyhedra, for example, by ingestion of contaminated 
plants or other food material, the cycle is repeated (4). 

20 In summary, the occluded form of the virus is responsible for the initial infection of the insect through 
the gut, as well as the environmental stability of the virus. PDVs are essentially not infectious when 
administered by injection, but are highly infectious orally. The non-occluded form of the virus (i.e., ECV) is 
responsible for secondary and cell to cell infection. ECVs are highly infectious for cells in culture or internal 
insect tissues by injection, but essentially not infectious by oral administration. 

25 The use of recombinant baculoviruses expressing foreign proteins which are toxic to insects is 
facilitated by the fact that these viruses are not pathogenic to vertebrates or plants. In addition, the 
baculoviruses generally have a narrow host range. Many strains are limited to one or a few insect species. 

The Autographa californica nuclear polyhedrosis virus (AcMNPV) is the prototype virus of the family 
Baculoviridae . The AcMNPV virus was originally isolated from Autographa californica (A. cai.), a lepidop- 

30 teran noctuid (which in its adult stage is a nocturnal moth), commonly known as the alfalfa looper. This virus 
infects 12 families and more than 30 species within the order of Lepidoptera insects (7). It is not known to 
infect productively any species outside this order. The most widely studied baculovirus is AcMNPV. This 
virus utilizes many of the protein maturation and transport systems that occur in higher eukaryotic cells. 
In this invention, a gene coding for an insect-specific toxin is inserted into a suitable location in the viral 

35 genome. Heterologous genes inserted into AcMNPV produce proteins which are biologically active in the 
infected insect cells. These proteins, for the most part, undergo post-translational processing to produce 
and secrete recombinant products very similar, if not identical, to those of authentic proteins. A toxin thus 
expressed enhances the bioinsecticidal effect of the virus. 

One such toxin is AalT, which is produced by the venom of the North African scorpion Androctonus 

40 Australis Hector. The toxin is 70 amino acids in length and binds to sodium channels in insects and causes 
contractile paralysis at the nanogram to microgram range in insect larvae. Because AalT does not bind to 
mammalian sodium channels, AalT is a candidate for use as a bioinsecticide to protect crops ingested by 
humans. 

The region upstream of the coding region of the AalT gene includes a signal sequence which directs 
45 the secretion of AalT from the cell. Specifically, the signal sequence directs the toxin through the secretory 
pathway to the cell surface where it is secreted from the cell. During transport, enzymes cleave the signal 
sequence, leaving the mature AalT. 

There is a continuing need for genetically engineered recombinant insect viruses which express 
heterologous toxins in infected hosts. Infection by these recombinant viruses increases the speed of kill 
50 when compared with the wild-type virus. 

Summary of the Invention 

This invention provides for the construction of a codon optimized DNA sequence encoding the insect- 
55 specific toxin AalT. The codon optimized sequence is then inserted into an insect virus such as the 
baculovirus AcMNPV. 

The baculovirus infects specific susceptible insect target species, resulting in the eventual death of the 
insect in its larval stage. The toxin AalT produced by a scorpion is also specific for susceptible insect target 
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species (but not vertebrates) and causes paralysis and ultimately death of the insect. 

The insertion of the gene encoding AalT into a baculovirus results in the expression of the toxin. A 
susceptible insect which ingests such a modified baculovirus will cease feeding on plants due to a toxin- 
induced paralysis at an earlier time than an insect which ingests a wild-type baculovirus lacking an AalT 

5 gene. The greater the reduction in time, the greater the reduction of damage to crops, because the period 
of larval feeding is reduced. Thus, this invention provides a method for protecting plants from damage from 
insects, by delivering to the plant (through spraying or other delivery means) an expression vector 
incorporating this codon optimized nucleic acid sequence for producing AalT. 

The degeneracy of the genetic code permits variations of the nucleotide sequence, while still producing 

to a polypeptide having the identical amino acid sequence as the polypeptide encoded by the native DNA 
sequence. 

The frequency of individual synonymous codons for cognate amino acids varies widely from genome to 
genome among eucaryotes and procaryotes. These differences in codon choice patterns appear to 
contribute to the overall expression levels of individual genes by modulating peptide elongation rates. 

75 The invention involves the design of an AalT gene for use in a baculovirus expression system. It is first 
attempted to have the preferred codon usage frequencies for this synthetic gene reflect the codon usages 
of genes derived from the genome of the cell/organism to be used for recombinant protein expression. 
However, adequate representation for both lepidopteran gene sequences and insect viral gene sequences 
was not available at the time of this invention to create a reliable codon usage table, because there was 

20 DNA sequence information available from only a small number of genes. 

Therefore, codon use tables are taken from a species, Drosophila melanogaster , which has a sufficient 
number (at least 10) of known gene sequences. These codon use tables are used to design the codon 
optimized gene encoding AalT. This methodology also permits the preservation or destruction (if desired) of 
restriction enzyme recognition sites. The codon use frequency for each amino acid of AalT reflects 

25 frequencies in Drosophila melanogaster codon use tables. 

The codon optimized gene encoding AalT of this invention has the sequence of 210 nucleotides set 
forth in SEQ ID NO: 3. This sequence differs from that of the wild-type (native) AalT gene sequence (SEQ 
ID NO: 1) in 58 out of 210 nucleotides (see Figure 1). 

30 Brief Description of the Drawings 

Figure 1 depicts a comparison of the nucleotide sequences of of native and codon optimized genes 
encoding AalT, together with the amino acid sequence of AalT. 

Figure 2 depicts detail of the construction of the plasmid pBS Cuticle-AalT, which contains the 
35 heterologous cuticle signal sequence and a codon optimized cDNA sequence encoding AalT. 

Figure 3 depicts detail of the construction of the plasmid pBS GIII-AalT, which contains the native signal 
sequence and the native cDNA sequence encoding AalT. 

Figure 4 depicts the levels of toxin-encoding RNA as a percentage of polyhedrin RNA which 
accumulates in Sf9 cells infected with wild-type AcMNPV. A DNA probe detects the 3' untranslated region 
40 common to both toxin and polyhedrin RNAs. The bars on the X axis are as follows: Mock (non-infected); 
Wild Type (wild-type AcMNPV); Es6-AalT (AcMNPV containing esterase-6 signal sequence and codon 
optimized AalT gene); ADK-AalT (AcMNPV containing adipokinetic signal sequence and codon optimized 
AalT gene); Cut-AalT (AcMNPV containing cuticle signal sequence and codon optimized AalT gene); pBM- 
AalT (AcMNPV containing pBMHPC-12 signal sequence and codon optimized AalT gene); Chor-AalT 
45 (AcMNPV containing Chorion signal sequence and codon optimized AalT gene); Apo-AalT (AcMNPV 
containing apolipophorin signal sequence and codon optimized AalT gene); Sex-AalT (AcMNPV containing 
sex specific signal sequence and codon optimized AalT gene); and grp lll-AalT (AcMNPV containing native 
signal sequence and native AalT gene). 

Figure 5 depicts graphs of the response time of H. virescens larvae in a viral injection assay. In Panel 
50 A, larvae receive either no virus (negative control)(80 insects) or 100, 1000 or 10,000 PFU of wild-type 
AcMNPV (Strain E2)(48 insects each). In Panel B, larvae receive either no virus (negative control)(80 
insects) or 100, 1000 or 10,000 PFU of AcMNPV which contains the cuticle signal sequence and a codon 
optimized cDNA encoding AalT (48 insects each). 

Figures 6-15 depict graphs of the oral toxicity of viruses, presented as a percentage of larvae 
55 responding at various time intervals to the indicated doses of virus present in a microdrop of ingested diet. 
Figure 6 depicts wild-type AcMNPV (Strain E2), which is used as a control for Figures 7-10. Figure 7 
depicts an esterase-6-AalT-virus construct. Figure 8 depicts an adipokinetic hormone-AalT-vims construct. 
Figure 9 depicts a cuticle-AalT-virus construct. Figure 10 depicts the native signal-native AalT-virus 



4 



EP 0 621 337 A1 



construct. Figure 11 depicts wild-type AcMNPV (Strain E2), which is used as a control for Figures 12-15. 
Figure 12 depicts a pBMHPC-12-AalT-virus construct. Figure 13 depicts a chorion-AalT-virus construct. 
Figure 14 depicts an apolipophorin-AalT-virus construct. Figure 15 depicts a sex specif ic-AalT-virus 
construct. The constructs in Figures 7-9 and 12-15 include a codon optimized AalT gene. 

5 

Detailed Description of the Invention 

The present invention provides for the construction of a codon optimized DNA sequence encoding the 
insect-specific toxin AalT. This isolated nucleic acid sequence is then inserted into an expression vector 
70 which is appropriate for the host cell or organism in which the toxin is to be produced. This toxin has the 
same amino acid sequence as native AalT. The DNA sequence can be inserted directly into the expression 
vector or can be inserted with the aid of a transfer vector. 

The insertion of the gene encoding AalT into an insect virus results in the expression of the toxin, which 
is responsible for a reduction in the time needed for the virus to incapacitate and kill larvae. A susceptible 
15 insect which ingests such a modified insect virus will cease feeding on plants at an earlier time than an 
insect which ingests a wild-type insect virus lacking an AalT gene. The greater the reduction in time, the 
greater the reduction of damage to crops, because the period of larval feeding is reduced. 

The degeneracy of the genetic code permits variations of the nucleotide sequence, while still producing 
a polypeptide having the identical amino acid sequence as the polypeptide encoded by the native DNA 
20 sequence. The procedure known as codon optimization provides one with a means of designing such an 
altered DNA sequence. 

The design of codon optimized genes should take into account a variety of factors, including the 
frequency of codon usage in an organism, nearest neighbor frequencies, RNA stability, the potential for 
secondary structure formation, the route of synthesis and the intended future DNA manipulations of that 
25 gene. 

The degeneracy of the genetic code permits the same amino acid sequence to be encoded and 
translated in many different ways. For example, leucine, serine and arginine are each encoded by six 
different codons, while valine, proline, threonine, alanine and glycine are each encoded by four different 
codons. 

30 However, the frequency of use of such synonymous codons varies from genome to genome among 
eucaryotes and procaryotes. For example, synonymous codon-choice patterns among mammals are very 
similar, while evolutionarily distant organisms such as yeast (S. cerevisiae ), bacteria (such as E. coli) and 
insects (such as D. melanogaster ) reveal a clearly different pattern of genomic codon use frequencies (8- 
13). 

35 These differences in codon-choice patterns appear to contribute to the overall expression levels of 
individual genes by modulating peptide elongation rates. Two lines of experimental evidence support this 
argument. First, the rate of polypeptide synthesis depends on the character of the codons being translated, 
as well as the initial kinetics for transfer RNA ("tRNA") ternary complex formation (13-18). Second, cellular 
transfer RNA distributions tend to follow the cognate codon frequencies of the messenger RNA ("mRNA") 

40 pool (18-21). 

For these reasons, it is desirable and useful to design genes intended for insect virus expression 
systems where the codon frequencies reflect the cognate cellular tRNA frequencies. This is particularly 
desirable for expression systems like the baculovirus expression system using the polyhedrin promoter, 
where a single mRNA specie may utilize over one-half the cellular translation capacity. Hence, codon 
45 frequency really should reflect the cognate cellular tRNA frequencies when expressing recombinant protein 
at high levels. 

The preferred codon usage frequencies for a synthetic gene should reflect the codon usages of nuclear 
genes derived from the exact (or as closely related as possible) genome of the cell/organism that is 
intended to be used for recombinant protein expression. 

so In the present invention, adequate representation for both lepidopteran gene sequences and insect viral 
gene sequences was not available to create a reliable codon usage table for these initial experiments. 

For example, for the design of the AalT gene, there was at the time of this invention DNA sequence 
information from only two specialized proteins: AcMNPV polyhedrin and AcMNPV p10 (22,23). Ikemura et 
al. have cautioned that reliable codon-choice patterns require the summation of codon frequencies from ten 

55 or more genes with varying functions (10,24). This codon optimized sequence is intended to be expressed 
in a variety of insects, not just those infected by AcMNPV. Therefore, it is of primary importance that the 
codon optimization be compatible with the tRNA pool distribution in a variety of insect cells. 
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For this reason, an insect specie is selected which has a sufficient number of known gene sequences. 
Drosophila melanogaster has codon use tables derived from 44 nuclear genes totalling 20,451 codons (11). 
These codon use tables are used to design the codon optimized genes encoding signal sequences and the 
AalT toxin sequence of this invention. 

5 A gene for a full length signal peptide sequence is fused to the codon optimized AalT toxin peptide 
sequence. As discussed in detail below, the signal sequence may be the AalT signal sequence or a native 
or codon optimized nucleotide sequence encoding a heterologous signal sequence. 

These amino acid sequences are reverse-translated into a DNA sequence showing all possible 
nucleotide degeneracies (maximum ambiguity). A list of definite and potential (because of codon degen- 

w eracies) restriction enzyme recognition sites is generated. Restriction enzyme sites critical for gene 
synthesis as well as for the convenience of additional DNA manipulation are selected and preserved. One 
important advantage of assigning codons during the design of optimized genes is that restriction enzyme 
sites can be readily preserved or destroyed at the convenience of one skilled in the art to facilitate gene 
contruction and manipulation. 

15 A list recording the number of times each amino acid occurs in AalT and each signal sequence is 
generated. Codons are assigned on the basis of the number of amino acid residues present in the 
sequence to reflect the relative frequency of Drosophila melanogaster codon use (1 1 ). 

For example, the cysteine codon frequency in Drosophila melanogaster is 76% for the sequence TGC 
and 24% for the sequence TGT. There are eight cysteine residues in the ADK signal-AalT gene construct 

20 (all in the gene coding for the mature AalT protein (SEQ ID NO: 3)). Hence, six codons are allocated as 
TGC (75% of the residues) and two codons as TGT (25% of the residues). Based on this distribution, each 
cysteine codon is then assigned an exact, unambiguous sequence. Attention is given to alternating adjacent 
isocodons. For example, the two adjacent cysteine residues in the AalT peptide (amino acids 37 and 38) 
are assigned the sequence TGT TGC to avoid assigning two identical codon sequences sequentially. 

25 This process is repeated for all twenty amino acids and the termination codon (although the termination 
codon sequence TAA is not changed by this procedure). In addition, all critical restriction enzyme sites are 
preserved and unwanted sites are destroyed. 

The codon optimized nucleic acid sequence of this invention is synthesized and assembled using 
conventional techniques. Example 2 below describes one such method using a series of synthetic 

30 oligonucleotide fragments which are then joined to form the complete coding sequence. 

The complete codon optimized sequence of this invention (SEQ ID NO: 3) differs from the native cDNA 
sequence encoding AalT (SEQ ID NO: 1) by 58 out of 210 nucleotides (see Figure 1). However, the AalT 
produced by the native cDNA and the codon optimized cDNA have identical amino acid sequences. If 
desired, the signal sequence for AalT may also be codon optimized by the same procedures just described. 

35 Once the synthetic gene containing the codon optimized nucleic acid sequence is constructed, the 
gene is inserted into an appropriate expression vector, by conventional techniques such as cloning into the 
vector after digestion with one or more appropriate restriction enzymes. Smith and Summers United States 
Patent 4,745,051 (6) describes the construction of baculovirus expression vectors, which are recombinant 
insect viruses in which the coding sequence for a foreign gene is inserted behind a baculovirus promoter in 

40 place of the viral polyhedrin gene. The polyhedrin gene is nonessential for productive viral infection 
between cells. 

Transfer vectors are used as tools to transfer foreign genes into a viral genome. Transfer vectors 
generally are bacterial ptasmids containing sufficient viral sequences to facilitate insertion of the foreign 
gene into the viral genome by homologous recombination. 

45 Methods for constructing recombinant baculoviruses in cultured cells are set forth in M. D. Summers 
and G. E. Smith, A Manual of Methods for Baculovirus Vectors and Insect Cell Culture Procedures, 
Texas Agricultural Experimental Station Bull. No. 1555 (1987) (25). A preferred cell line is that of Sf9 cells 
(ATCC accession number CRL1711), which are derivatives of the cell line designated Spodoptera frugiperda 
21 (Sf21). Other insect ceil lines that are adequate for propagation of a desirable insect virus include those 

so derived from Trichoplusia ni (TN368), the silkworm Bombyx mori (BM) and Helicoverpa zea (BCIRL-Hz- 
Am1, BCIRL-Hz-Am3). 

Suitable insect viruses include those listed above in the Background of the Invention. A preferred insect 
virus is the baculovirus AcMNPV. A particular strain of AcMNPV designated E2 is used in the Examples. 
Those of skill in the art will recognize that other baculovirus strains may also be used. These include 
55 Trichoplusia ni MNPV, Rachiplusia ou MNPV, Galleria mallonella MNPV, Spodoptera frugiperda NPV and 
plaque-purified strains such as the M3, R9, S1 and S3 strains of AcMNPV isolated and characterized in 
Smith, G.E., and Summers, M.D., J. Virol. , 33, 311-319 (1980) (26), as well as Bombyx mori NPV. See also 
Smith, G.E., and Summers, M.D., Virol. , 89, 517-527 (1978) (27). 
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As described above, the expression of AalT by an insect virus reduces the time needed to incapacitate 
a larva. In turn, the maturation and secretion of functional toxin is facilitated by a signal peptide. 

Signal sequences are required for a complex series of post-translation al processing steps which result 
in secretion of a protein. If an intact signal sequence is present, the protein being expressed enters the 

5 lumen of the rough endoplasmic reticulum and is then transported through the Golgi apparatus to secretory 
vesicles and is finally transported out of the cell. Generally, the signal sequence immediately follows the 
initiation codon and encodes a signal peptide at the amino-terminal end of the protein to be secreted. In 
most cases, the signal sequence is cleaved off by a specific protease, called a signal peptidase. Preferred 
signal sequences improve the processing and export efficiency of recombinant protein expression using 

70 viral expression vectors. Optimized expression of the toxin using an appropriate heterologous signal 
sequence achieves more rapid lethality than wild-type insect virus. 

Summers United States Patent 5,155,037 (28) describes the use of insect signal sequences such as 
adipokinetic hormone and cuticle in a Lepidopteran insect cell transformed or transfected with a DNA 
vector, where said signal directs the secretion of a heterologous protein from the insect cell. The only 

75 foreign proteins disclosed are CD4, interleukin-2 and beta-interferon. All of these proteins are mammalian 
proteins; none is an insect toxin. Prior to United States Patent 5,155,037, the only method of achieving 
secretion of a foreign protein in insect cells is by using the foreign gene's native signal peptide. Because 
the foreign genes are usually from non-insects, their signals may be poorly recognized by insect cells, 
resulting in suboptimal levels of expression. The signal sequence is generally coded by a DNA sequence 

20 immediately following (5 f to 3 f ) the translation start site of the foreign gene. Proteins secreted from cells are 
synthesized as precursor molecules containing hydrophobic N-terminal signal peptides. 

A secretory signal sequence contains three regions: A basic N-terminal region (n-region); a central 
hydrophobic region (h-region); and a more polar C-terminal region (c-region) (29). Cleavage of the signal 
sequence from the mature protein once export is under way appears to be dependent upon structures in 

25 the n- and h- regions; however, positions -3 and -1 relative to the cleavage site (in the c- region) have been 
suggested to be the most important (29). The residue in position -1 should be small (alanine, serine, 
glycine, cysteine, threonine or glutamine), while the residue in position -3 should not be aromatic 
(phenylalanine, histidine, tyrosine or tryptophan), charged (asparagine, glutamic acid, lysine or arginine) or 
large and polar (asparagine or glutamine), with proline being absent from the region -3 to +1. Some 

30 exceptions to these parameters are permitted (29). 

This -3, -1 rule is 75-80% predictive of the cleavage site in mammalian systems (29), but has not been 
confirmed in insect systems. 

Summers United States Patent 5,155,037 (28) describes the sequence encoding the Lepidopteran 
( Manduca sexta) adipokinetic hormone signal peptide. This is a short signal peptide (19 amino acids) for a 

35 blocked neuropeptide that regulates energy substrate mobilization and metabolism in insects. Figure 2B of 
the patent depicts the nucleotide and amino acid sequences of the signal peptide. 

The patent also describes the sequence encoding Drosophila melanogaster cuticle signal peptide. 
Figure 2A of the patent depicts the nucleotide and amino acid sequences of the cuticle gene CP1 leader 
peptide. The nucleotide sequence codes for a 16 amino acid signal peptide. 

40 In an effort to identify heterologous signal sequences useful with the codon optimized AalT gene, eight 
constructs with heterologous signal sequences are prepared by Applicants. The DNA sequences encoding 
these signal sequences can be either the native DNA sequences encoding the signal sequences or the 
codon optimized DNA sequences encoding those signal sequences. 

These signal sequences are then combined with the codon optimized AalT gene described previously 

45 in Bluescript plasmids (Stratagene, LaJolla, CA). Each such gene is inserted into a baculovirus transfer 
vector and then moved into the AcMNPV genome by homologous DNA recombination in cultured cells (25). 
The heterologous signal sequences are as follows: lnterleukin-2, Esterase-6, Adipokinetic hormone, Cuticle, 
pBMHPC-12, Chorion, Apolipophorin, and Sex Specific. 

lnterleukin-2 is a human signal sequence (for a T-cell growth factor) whose use with AalT has previously 

so been described (30). The adipokinetic hormone signal sequence (for a neuropeptide) is from Manduca sexta 
and fits the -3, -1 rule (29) well. The chorion signal sequence from Bombyx mori (for an eggshell structural 
protein) and the cuticle signal sequence of Drosophila melanogaster (for an exoskeletal protein) both 
secrete a large quantity of their associated mature proteins. The apolipophorin signal sequence from 
Manduca sexta (for a plasma lipid transport protein), the sex specific signal sequence of Bombyx mori (for a 

55 major adult plasma storage protein) and the pBMHPC-12 signal sequence from Bombyx mori (for a 30 kD 
larval plasma lipoprotein) all are selected because they are found in fat body cells and and each secrete a 
hemolymph protein. Finally, the esterase-6 signal sequence is used for the secretion of a serine hydrolase 
from Drosophila melanogaster . Each of these signal sequences except apoliphophorin complies with the -3, 
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-1 rule (29) for predicting the cleavage site. 

The cDNA encoding each signal sequence can have either its native sequence or a codon optimized 
sequence generated as described above. Specific codon optimized signal sequences are constructed as 
described in Example 2 below. 

5 The codon optimized signal sequences differ from the native sequences as follows: 

pBMHPC-12 - 7 of 48 nucleotides (compare SEQ ID NO: 5 (codon optimized) with SEQ ID NO: 7 (native)); 
adipokinetic hormone - 6 of 57 nucleotides (compare SEQ ID NO: 9 (codon optimized) with SEQ ID NO: 11 
(native)); apolipophorin - 14 of 69 nucleotides (compare SEQ ID NO: 13 (codon optimized) with SEQ ID NO: 
15 (native)); chorion - 10 of 63 nucleotides (compare SEQ ID NO: 17 (codon optimized) with SEQ ID NO: 19 

70 (native)); cuticle - 6 of 48 nucleotides (compare SEQ ID NO: 21 (codon optimized) with SEQ ID NO: 23 
(native)); esterase-6 - 15 of 63 nucleotides (compare SEQ ID NO: 25 (codon optimized) with SEQ ID NO: 27 
(native)); sex-specific - 15 of 45 nucleotides (compare SEQ ID NO: 29 (codon optimized) with SEQ ID NO: 
31 (native)). However, the amino acids of the signal sequences encoded by the differing nucleic acid 
sequences are identical for both the codon optimized and native sequences. 

75 An example of a codon optimized DNA sequence encoding a heterologous cuticle signal sequence 
joined to a codon optimized DNA sequence encoding AalT and inserted into a baculovirus transfer vector is 
the transfer vector designated pAC0055.1 (see Examples 1, 2 and 4 beiow). Samples of an E. coli strain 
HB101 harboring this transfer vector pAC0055.1 were deposited by applicants on December 17, 1992 with 
the American Type Culture Collection, 12301 Parklawn Drive, Rockville, Maryland 20852, U.S.A., and have 

20 been assigned ATCC accession number 69166. Using this deposited material, one of ordinary skill in the art 
can substitute a different signal sequence for the cuticle sequence contained in that plasmid. 

Specifically, the first step is to construct a synthetic double stranded DNA fragment which has the 
following features: (i) a 5' terminal Bam Hl-compatible cohesive end; (ii) a sequence encoding the ATG start 
codon and amino acid sequence of the new signal peptide; and (iii) a sequence encompassing the first 19 

25 nucleotides in the top strand of the codon optimized AalT gene and the first 23 nucleotides in the bottom 
strand. The 3' terminus of such a fragment contains a Sal l-compatible cohesive end. Substitution of this 
fragment for the corresponding Bam Hl/Sal I fragment of pAC0055.1 is achieved by joining the following 
three fragments in the presence of-DNA ligase: (1) the Bam Hl/Sal I synthetic fragment described above; (2) 
a Sal l/Kpn I fragment extending from the Sal I site at codons 6-7 in the codon optimized AalT toxin gene to 

30 the Kpn I site in the 3* flanking polyhedrin gene sequence; and (3) the largest Kpn I/Bam HI fragment 
contained in pAC0055.1. This fragment, which is derived from pVL985 (31), contains the pUC8 cloning 
vector and AcMNPV sequences which flank the 5, and 3' termini of the polyhedrin gene. Those skilled in 
the art will recognize that the resulting construct differs from pAC0055.1 only in the nature of the signal 
peptide. 

35 Once recombinant viruses are prepared containing the codon optimized gene (see discussion above 
and Examples 1, 2, 4 and 5 below), they are tested to confirm the presence of the codon optimized AalT 
gene. Because the codon optimized gene replaces the baculovirus polyhedrin gene, the recombinant 
viruses which carry the toxin gene are unable to form polyhedra and thus produce occlusion-negative 
plaques. Plaque purification is used to isolate recombinant viruses containing the AalT gene from the 

40 transfection supernatant by the method of Summers and Smith (25) (see Example 5). 

Next, virus infected cells are assayed by dot blot hybridization for nucleic acids that hybridize to a 
radiolabeled DNA probe specific for the AalT gene. Radioactivity is detected by autoradiography; a positive 
result confirms the presence of the AalT gene in a virus isolate (see Example 6). 

Methods similar to those described in Examples 1 , 2, 4 and 5 below are used to construct a baculovirus 

45 transfer vector containing the native AalT gene and the native nucleic acid sequence encoding the native 
AalT signal sequence, and then to prepare recombinant virus therefrom (see Examples 3-5 below). 
Polymerase chain reaction is used to verify that the recombinant viruses contain an insert of the correct size 
containing the AalT gene (see Example 7 below). The recombinant viruses containing the native AalT gene 
are used as positive controls in assays for the biological activity of recombinant viruses containing the 

50 codon optimized AalT gene with heterologous signal sequences. 

Expression of the AalT gene in insect cells infected with either the codon optimized or native AalT- 
viruses is estimated by Northern blot analysis of total cellular RNA isolated from virus-infected cells (see 
Example 8 below). Analysis shows high levels of toxin-encoding RNA 24 hours post-infection (see Example 
8). 

55 As mentioned above, the baculoviruses infect a wide range of insects, but are not harmful to mammals. 
All species of insects examined are susceptible to paralysis caused by AalT; however, mammals are not 
adversely affected by the toxin. 
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The comparative biological activity of a wild-type baculovirus, a virus containing the native AalT gene, 
and a virus containing the codon optimized AalT gene (which may also contain a heterologous signal 
sequence) is assayed by two types of in vivo tests with insect larvae: an injection assay and a feeding 
assay. 

In a preliminary assay, the culture medium and cell pellets (resuspended in buffer solution) obtained 
during the preparation of codon optimized AalT-viral stocks are injected into Musca domestica (common 
house fly) larvae to determine whether those samples contain a detectable amount of biologically active 
AalT. 

Larvae are examined five seconds after injection for evidence of involuntary contraction of body 
segments. Results from the assay (see Table in Example 9 below) show that biologically active AalT is 
detected in the cultured cells infected with 13 of 14 codon optimized AalT-virus isolates with heterologous 
signal sequences (esterase-6, adipokinetic hormone and cuticle). The negative results are expected with the 
IL2-AalT-virus constructs due to the presence of a frame shift mutation in the toxin gene coding region. The 
lack of detectable AalT activity in any of the cell culture supernatants indicates that the level of biologically 
active secreted toxin is less than about 0.2 y.g/ml. 

A dose-response injection assay demonstrates that the insertion of the codon optimized AalT gene 
enhances the performance of a baculovirus by reducing the time needed to kill a target insect species. As 
shown in Example 10 below and Figure 5, budded virus prepared from the codon optimized Cuticle-AalT- 
AcMNPV construct and from the wild-type E2 strain of AcMNPV without the AalT gene are injected into 
separate groups of mid-fourth instar larvae at doses of 10 2 , 10 3 and 10 4 PFU per larva. 

At each of the doses tested, the Cuticle-AalT-AcMNPV kills its host faster than the wild-type AcMNPV. 
At 10 4 PFU, the LT 50 for the wild-type virus is approximately 97 hours, whereas the corresponding LT 5 o for 
the Cuticle-AalT AcMNPV is 67 hours. This result shows that insertion of the AalT gene and the cuticle 
signal sequence into AcMNPV accelerates the speed of kill through the expression of biologically active 
toxin. 

The injection assay protocol is repeated using older larvae (early fifth instar) with seven different codon 
optimized heterologous signaJ-AalT-AcMNPV constructs, a native AalT signal-native AalT gene-AcMNPV 
construct and a wild-type E2 strain of AcMNPV without the AalT gene. A dosage of 10 4 PFU of budded 
virus of each construct is injected into separate groups of larvae. 

As described in Example 11 below, each of the codon optimized heterologous signal-AalT-AcMNPV 
constructs (as well as the native AalT signal-native AalT gene-AcMNPV construct) kills its host faster than 
the wild-type AcMNPV. The LT 50 for the wild-type virus is approximately 126 hours, whereas the 
corresponding LT 50 for the seven heterologous signal-AalT-AcMNPV constructs range from approximately 
68-89 hours and the native AalT-AcMNPV construct has an LT 50 of approximately 74 hours. These results 
confirm the results of Example 10 that insertion of the AalT gene and a heterologous signal sequence into 
AcMNPV accelerates the speed of kill through the expression of biologically active toxin. 

In order to test the oral toxicity of these recombinant viruses, which are defective for the production of 
polyhedrin, polyhedra containing a mixture of wild-type and recombinant virions are prepared by co- 
infecting cells in culture with the recombinant AalT-virus and a wild-type helper virus (such as the E2 strain 
of AcMNPV). This approach is made possible because one polyhedron (occlusion body) contains several 
hundred virions. 

Accordingly, host cells are co-infected with various amounts of the recombinant AalT-virus, as well as a 
wild-type virus. The resulting infected cells have polyhedra containing a mixture of wild-type and recom- 
binant virions. As seen in Example 12 below, when the MOI of the wild-type virus is at least 2 PFU per cell, 
as the MOI of the recombinant virus increases, the percent of cells with viral occlusions and the average 
number of polyhedra per cell decrease significantly. Therefore, further co-occlusion studies are conducted 
with the occlusion- negative recombinant virus and the wild-type virus, each at an MOI of 3. This allows 
effective representation of the recombinant virions in the polyhedra at an acceptable level of polyhedron 
production. 

Finally, an oral toxicity assay is conducted by feeding larvae a microdrop of insect diet containing a 
desired amount of PIBs. With the exception of the wild-type virus, each PIB contains a mixture of wild-type 
and recombinant virions. The recombinant virions may encode either the native AalT gene (and native 
signal sequence), or the codon optimized gene linked to a heterologous signal sequence. 

Larvae are then monitored for paralysis and death. The results of the assay are set forth in Example 13 
and Figures 6-15. 

All but one of the recombinant viruses demonstrate an earlier onset of morbidity than the wild-type 
AcMNPV (which lacks the AalT gene). The exception is the AcMNPV construct containing the pBMHPC-12 
signal sequence linked to the codon optimized sequence encoding AalT, which performs approximately the 
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same as the wild-type AcMNPV. The recombinant viruses exhibit a characteristic paralytic response which 
is easily distinguished from the pathogenesis caused by the wild-type virus. The actual ratio of wild-type to 
recombinant virus in the polyhedra and in the infected insects is not analyzed. 

In order that this invention may be better understood, the following examples are set forth. The 
5 examples are for the purpose of illustration only and are not to be construed as limiting the scope of the 
invention. 

Examples 

70 Standard molecular biology techniques are utilized according to the protocols described in Sambrook et 
al. (32). 

Example 1 

75 Determination of a Codon Optimized cDNA Sequence Encoding AalT 

The design of a codon optimized gene for AalT begins with an attempt to reflect the codon usages of 
nuclear genes derived from the exact (or as closely related as possible) genome of the cell/organism that is 
intended to be used for recombinant protein expression. Here, however, adequate representation for both 
20 lepidopteran and baculoviral gene sequences was not available to create a reliable codon usage table for 
these initial experiments. 

Ikemura et al have cautioned that reliable codon-choice patterns require the summation of codon 
frequencies from ten or more genes with varying functions (10,24). However, at the time of this invention, 
there was DNA sequence information from only two specialized proteins: AcMNPV polyhedrin and AcMNPV 
25 p10 (22,23). 

For this reason, Drosophila melanogaster is selected because its codon use tables are derived from 44 
nuclear genes totalling 20,451 codons (11). These codon use tables are used to design the codon optimized 
genes encoding signal sequences and the AalT toxin sequence of this invention. 

Each full length signal peptide sequence is fused to the AalT toxin peptide sequence. These amino acid 
30 sequences are reverse-translated into a DNA sequence showing all possible nucleotide degeneracies 
(maximum ambiguity) using IntelliGenetics Suite™ software (version 1988; Palo Alto, CA). IntelliGenetics 
Suite™ software is used for all computer-assisted nucleic acid sequence analysis. 

A list of definite and potential (because of codon degeneracies) restriction enzyme recognition sites is 
generated. Restriction enzyme sites critical for gene synthesis as well as for the convenience of additional 
35 DNA manipulation are selected and preserved. For example, Bam HI and Ava I restriction enzyme sites are 
required for gene synthesis using Bluescript SK/KS vectors (Stratagene, LaJolla, CA). Sal I and Apa I 
restriction sites are desirable to permit flexibility for additional gene manipulation. 

Using IntelliGenetics Suite™ software, a list recording the number of times each amino acid occurs in 
AalT and each signal sequence is generated. Codons are assigned on the basis of the number of amino 
40 acid residues present in the sequence to reflect the relative frequency of Drosophila melanogaster codon 
use (11). 

For example, the cysteine codon frequency in Drosophila melanogaster is 76% for the sequence TGC 
and 24% for the sequence TGT. There are eight cysteine residues in the ADK signal-AalT gene construct 
(all in the gene coding for the mature AalT protein (SEQ ID NO: 3)). Hence, six codons are allocated as 
45 TGC (75% of the residues) and two codons as TGT (25% of the residues). Based on this distribution, each 
cysteine codon is then assigned an exact, unambiguous sequence. Attention is given to alternating adjacent 
isocodons. For example, the two adjacent cysteine residues in the AalT peptide (amino acids 37 and 38) 
are assigned the sequence TGT TGC to avoid assigning two identical codon sequnces sequentially. 

This process is repeated for all twenty amino acids and the termination codon (although the termination 
so codon sequence TAA is not changed by this procedure). In addition, all critical restriction enzyme sites are 
preserved and unwanted sites are destroyed. 

Flanking DNA sequences for the plasmid vectors as well as the transfer vectors are fused to the 
completed gene sequences to confirm that there are no undesirable restriction enzyme sites at the splice- 
junction boundaries. 

55 
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Example 2 

Construction of Gene Cassettes Containing an Heterologous Signal Sequence Plus the Codon Optimized 
cDNA Sequence Encoding AalT 

5 

Eight heterologous signal sequence-codon optimized AalT toxin gene cassettes are synthesized and 
assembled in two pieces: A "B" fragment, unique for each construct, consisting of DNA coding for one of 
eight heterologous signal sequences, plus the amino terminal portion of the toxin coding region (common to 
all constructs), and an "A" fragment, which is the same for each construct and encodes the remainder of 

70 the toxin coding region. 

Each of fragments A and B is made by annealing a pair of oligomers containing a 15 bp overlap 
purchased from New England Biolabs (Beverly, MA). Sequenase™ 2.0, a DNA polymerase (United States 
Biochemical Corporation, Cleveland, OH), is used to complete the double stranded molecule, which 
contains the heterologous signal sequence plus the codon optimized cDNA sequence encoding AalT. The 

is following is a list of the oligomers used in the construction of these eight constructs: 



Fragment A 

20 

El 5' AGCCCCCGAG TOCCTOCTCT CGAACTATTG 

CAACAATGAA TGCACCAAGG TGCACTACGC 
TGACAAGGGC TACTGTTGCC TTCTGTCCTG 
25 CTATTGCTTC 3' (SEQ ID NO: 33) 



E2 5' CTGTAGGTAC CGGATCCTTA GTTAATGATG 

30 GTGGTGTCAC AGTAGCTCTT GCGAGTATCA 

GAGATTTCCA GAACTTTCTT GTCGTCGTTG 
AGACCGAAGC AATAGCAGGA 3 f 
(SEQ ID NO: 34) 



Fragments Bl-8 
40 Common 

5' AGGCACTCGG GGGCTTTTCC GGATGAGGTC 
GACTGCGTAG CCGTTCTTCT T 3' 

45 



50 



55 
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IL-2 

5' CCCCCCGGAT CCATGTACCG CATGCAGCTG 
CTCTCCTGCA TCGCCCTGTC GCTGGCTCTG 
GTGACCAATA GCAAGAAGAA CGGCTAC 3' 
(SEQ ID NO: 36) 

ADK 

5' CCCCCCGGAT CCATGTACAA ACTGACCGTC 
TTCCTGATGT TCATCGCCTT CGTGATTATC 
GCTGAGGCCA AGAAGAACGG CTAC 3' 
(SEQ ID NO: 37) 

Chorion 

5' CCCCCCGGAT CCATGTTCAC CTTCGCTATT 
CTGCTCCTGT GCGTGCAAGG CTGCCTGATC 
CAGAATGTTT ACGGAAAGAA GAACGGCTAC 
(SEQ ID NO: 38) 

Esteraee-6 

5' CCCCCCGGAT CCATGAACTA CGTCGGGCTG 
GGCCTCATCA TTGTGCTGTC GTGCTTGTGG 
CTGGGGAGCA ATGCTAAGAA GAACGGCTAC 
(SEQ ID NO: 39) 

Apol ipophor in 

5' CCCCCCGGAT CCATGGCCGC TAAATTCGTC 
GTGGTTCTGG CCGCTTGCGT CGCCCTGAGC 
CACTCGGCTA TGGTGCGCCG CAAGAAGAAC 
GGCTAC 3' (SEQ ID NO: 40) 

Sex- Specif ic 

5' CCCCCCGGAT CCATGCGCGT CCTGGTGCTG 
TTGGCCTGCC TGGCAGCCGC TAGCGCTAAG 
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AAGAACGGCT 



AC 3' 



(SEQ ID NO: 41) 



Cuticle 



5 



GTTCGTGATG 



CCCCCCGGAT 



CCATGTTCAA 



ATCTGCGCCG 



TCCTCGGCCT 



GGCTGTGGCC 



AAGAAGAACG 



GCTAC 3' 



(SEQ ID NO: 



42) 



70 



pBMHPC-12 

5' CCCCCCGGAT CCATGAAACT CCTGGTCGTG 
TTCGCCATGT GCGTGCCCGC TGCCAGCGCT 

70 

AAGAAGAACG GCTAC 3' (SEQ ID NO: 43) 

Figure 2 depicts the construction strategy for the AalT gene cassette which contains the Drosophila 
20 melanogaster cuticle signal sequence. The remaining seven heterologous signal sequence-AalT gene 
cassettes are constructed in the same way using the specific oligo for the signal sequence as indicated 
above. 

In the first step, the signal sequence-specific oligo for the B fragment and the common B oligo are 
annealed and the single stranded regions are filled in using Sequenase™ 2.0. Each signal sequence- 

25 specific oligo consists of an identical 12 nucleotide upstream noncoding region containing the Bam HI site 
required for cloning, the coding region for the signal sequence, and the first 15 nucleotides encoding the 
first five amino-terminal amino acids of AalT. This short piece of double stranded DNA is digested with Bam 
HI and Ava I and subcloned into pBluescript SK+ (Stratagene, LaJolla, CA) which had been digested with 
Xma I and Bam HI. Positive subclones are verified by the presence of a Pvu II fragment larger than the 445 

30 bp fragment present in pBluescript SK+ alone. There is a single base pair mismatch between the 
overhangs for the Ava I and Xma I sites. Clones which have corrected this mismatch to regenerate the Aval 
site and lose the Xma I site are selected by the appropriate restriction enzyme digestions. In Figure 2, the 
resulting plasmid is designated pBS Cuticle B7. 

To construct the A fragment encoding the bulk of the AalT coding region, oligos E1 and E2 are 

35 annealed and Sequenase™ 2.0 is used to fill in the single stranded regions. This results in a double 
stranded molecule which contains an Ava I site at the 5' terminus and nested Bam HI and Asp 718 sites 
following the 3* end of the toxin coding region. The Bam Hi and Asp 718 sites are introduced in order to 
facilitate cloning. Fragment A is then digested with Ava I and Asp 718. The fragment B-containing plasmid, 
pBS Cuticle B7, is also digested with Ava I and Asp 718 and Fragment A is subcloned into the digested 

40 plasmid. In the case of Figure 2, the resulting plasmid is designated pBS Cuticle-AalT. Fragments A and B 
are joined at a unique Ava I site which is engineered into the AalT sequence just for this purpose. After the 
ligation of fragment A into the fragment B-containing plasmid, the ligated DNA is used to transform 
competent DHa E. colL Minipreps of plasmid DNA are prepared from the resulting bacterial colonies. 
Restriction enzyme analysis is used to determine which colonies contain the desired recombinant DNA. 

45 Further restriction enzyme analysis, followed by DNA sequencing, is used to confirm the integrity of the 
eight plasmids containing the codon optimized cDNA encoding AalT. 



so Construction of Gene Cassette Containing the Native cDNA Sequence Encoding AalT 

A gene cassette containing the native cDNA sequence encoding AalT is constructed in two pieces, but 
in a manner different from the construction of the codon optimized genes of Example 2. Instead of using 
long oligos which have only a short region of overlap to construct the gene, sets of oligos which completely 
55 overlap are used. A second difference in the manner of construction is necessitated because the native 
DNA sequences for both the AalT signal sequence and coding region are used. No artificial restriction 
enzyme sites are engineered into the DNA sequence. Therefore, a different method of joining the signal 
sequence and coding region of the AalT gene is required. In this method, asymmetrically cutting restriction 
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enzymes are used to allow functionally separate regions of the gene to be neatly fused to each other. This 
allows one to mix and match premade regions in any future constructions. The sequences of the oligomers 
numbered 1-10 used for the construction of the native AalT gene are provided in the list below: 



Oligo 1 

GATCCGATGA AATTTCTCCT ATTGTTTCTC 
GTAGTCCTTC CAATAATGGG GGTGCTTGGC 
(SEQ ID NO: 44) 



Oligo 2 

GCCAAGCACC CCCATTATTG GAAGGACTAC 
GAGAAACAAT AGGAGAAATT TCATCG 
(SEQ ID NO: 45) 
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Oligo 3 



Oligo 4 



Oligo 5 



Oligo 6 



Oligo 7 



Oligo fl 



Oligo 9 



AAGAAGAATG GATATGCCGT CGATAGTAGT 
GGTAAAGCTC (SEQ ID NO: 46) 



CTCAAAAGAC ATTCAGGAGC TTTACCACTA 
CTATCGACGG CATATCCATT CTTCTT 
(SEQ ID NO: 47) 



CTGAATGTCT TTTGAGCAAT TACTGTAACA 
ACGAATGCAC AAAAGTACAT TATGCT 
(SEQ ID NO; 48) 



CAGCAATATC CTTTGTCAGC ATAATGTACT 
TTTGTGCATT CGTTGTTACA GTAATTG 
(SEQ ID NO: 49) 



GACAAAGGAT ATTGCTGCTT ACTTTCATOT 
TATTGCTTCG GTCTAAATOA CGATAAAAAA 
GXTTTG (SEQ ID NO: 50) 



CTTGTGTCCG AAATCTCCAA AACTTTTTTA 
TCGTCATTTA GACCGAAGCA ATAACATGAA 
AGTAAG (SEQ ID NO: 51) 



GAGATTTCGG ACACAAGGAA AAGTTATTGT 
GACACCACAA TAATTAATTA AG 
(SEQ ID NO: 52) 



Oligo 10 



AATTCTTAAT TAATTATTGT GGTGTCACAA 
TAACTTTTC (SEQ ID NO: 53) 
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In order to facilitate the cloning of the native AalT toxin gene cassette, a modified Bluescript SK 
piasmid is generated. This vector designated pBS SK Bsm I has a Bgl II linker (CAGATCTG) inserted into 
the Nae I site located at 330bp on the Bluescript map. This linker insertion destroys the Nae I site. In 
addition this vector contains a modified polylinker as shown at the top in Figure 3. This modified piasmid is 

5 designated pBS SK Bsm I. This polylinker contains a Nae I site located adjacent to a Bsm I site, such that 
any sequence cloned into the Nae I site can be cleanly excised as a blunt-ended molecule by using the 
Bsm I restriction enzyme followed by filling in of the sticky end with the Klenow fragment of DNA 
polymerase I (Gibco/BRL, Gaithersburg, MD). 

To construct the native signal sequence portion of the cassette, oligos 1 and 2 are annealed. When 

70 these two oligos are properly annealed, they contain a blunt 3' end and a Bam HI sticky end at the 5' 
terminus. The piasmid pBS SK Bsm I is digested with Bam HI and Nae I. The annealed oligos are then 
ligated to the digested piasmid. The resulting piasmid subclone is designated pBS Gill sig. 

The coding region of the AalT toxin is constructed as two pieces of DNA. The 5' ends of oligos 4, 5, 8 
and 9 are phosphorylated using T4 kinase. Oligos 3, 6, 7 and 10 are left without a terminal phosphate. Two 

t5 annealing reactions, followed by a ligation reaction using T4 DNA ligase (New England Biolabs, Beverly, 
MA) are carried out Oligos 3, 4, 5, and 6, which encode the amino terminal portion of the AalT toxin, are 
annealed and ligated as shown in Figure 3. Oligos 7, 8, 9 and 10, which encode the carboxy terminus of the 
AalT toxin, are annealed and ligated as shown in Figure 3. 

These two oligo fragments are purified by electrophoresis on a 2.5% low melt agarose gel (Bio-Rad, 

20 Richmond, CA) containing TAE (40 mM Tris-acetate, pH 7.8, 1 mM EDTA). When the fragments are 
sufficiently separated from contaminating fragments, they are cut out as gel slices and placed in individual 
tubes. The fragments are purified by phenol extraction as described in Sambrook et al. (32) with the 
following modifications: (1) Sodium chloride is added to the gel slices to a final concentration of 1.5 M prior 
to heating. No further addition of salt is required in the subsequent steps. (2) Only two phenol extractions 

25 are performed on the liquified and diluted gel slice prior to the ethanol precipitation. 

The piasmid pBS Gill sig is digested with Bsm I and the sticky end is filled in and blunted with the 
Klenow fragment of DNA polymerase I (Gibco/BRL). As described above, this results in opening the piasmid 
at the last base pair of the signal sequence. The piasmid is then further digested with Eco Rl. A three way 
ligation is then set up between the gel purified oligo fragments and the digested piasmid. Positive 

30 subclones are verified by restriction enzyme analysis, followed by DNA sequencing. One positive clone 
designated pBS GIII-AalT contains the native signal sequence and full length native coding region of AalT 
(Figure 3). 

Example 4 

35 

Insertion of AalT Gene Constructs into Baculovirus Transfer Vectors 

The heterologous signal sequence-codon optimized AalT gene cassettes of Example 2 are isolated as 
Bam HI fragments from the pBS signal-AalT clones (see pBS Cuticle-AalT example in Figure 2. These Bam 

40 HI fragments are subcloned into the pVL 985 baculovirus transfer vector DNA (31) which had been digested 
with Bam HI. Restriction enzyme analysis followed by sequencing of the insert is used to confirm the 
correct orientation and integrity of the pVL heterologous signal sequence-codon optimized AalT clones. 

For example, the piasmid pBS Cuticle-AalT is subcloned into the pVL baculovirus transfer vector to 
yield the piasmid designated pAC0055.l. 

45 The native AalT gene cassette of Example 3 is isolated as a Bam HI to Eco Rl fragment from the pBS 
GIII-AalT clone (see Figure 3). These Bam HI to Eco Rl fragments are subcloned into pVL 1393 baculovirus 
transfer vector DNA (33) which had been digested with Bam HI and Eco RL Restriction enzyme analysis 
followed by sequencing of the insert is used to confirm the integrity of the pVL native AalT clones. 

so Example 5 

Generation of Recombinant Viruses Encoding AalT 

Recombinant viruses containing the native and codon optimized AalT genes under the control of the 
55 AcMNPV polyhedrin promoter are generated by homologous DNA recombination in cultured Sf9 cells, as 
described by M. D. Summers and G. E. Smith (25). Sf9 cells (ATCC accession number CRL1711) are 
derivatives of the cell line designated Spodoptera frugiperda 21 (Sf21). 
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In this procedure, 2.0 x 10 6 Sf9 cells are seeded in a 60 mm culture dish in 5 ml of supplemented 
TNM-FH medium (Grace's insect medium (34) supplemented with 0.33% TC lactalbumin hydrolysate 
(Difco, Detroit, Ml) and 0.33% TC yeastolate (Difco)), 10% fetal bovine serum, 0.1% Pluronic™ F-68 
(Gibco/BRL)). Once the cells are firmly attached (2-16 hours), the medium is removed and replaced with 

5 0.75 ml of Graced insect medium supplemented with 10% fetal bovine serum. One microgram of AcMNPV 
(E2 strain) DNA is mixed with 2 ug of AalT-transfer vector DNA from Example 4 in 0.75 ml of transfection 
buffer (25 mM HEPES, pH 7.05, 140 mM NaCI, 125 mM CaCI 2 ) and added dropwise to the cells. The cells 
are then incubated at 27 'C for 4 hours. At the end of the incubation period, the transfection medium is 
removed and the cells are washed once with TNM-FH, fed again with 5 ml of supplemented TNM-FH, and 

70 placed in a 27 *C incubator. After five days, the medium is removed from the cells, clarified by 
centrifugation for 10 minutes at 2000 rpm in a Beckman GPR centrifuge, and stored at 4°C. This 
constitutes the primary transfection supernatant. 

Because the native and codon optimized AalT genes replace the AcMNPV polyhedrin gene, the 
recombinant viruses which carry these toxin genes are unable to form polyhedra and, therefore, give rise to 

75 occlusion-negative plaques. Using this phenotype as a basis for identification, recombinant viruses contain- 
ing the AalT genes are isolated from the primary transfection supernatant by three rounds of plaque 
purification, using the plaque assay method of M. D. Summers and G. E. Smith (25). 

In this procedure, 1.5-2.0 x 10 6 Sf9 cells are seeded into a 60 mm culture in supplemented TNM-FH. 
After the cells attach (2-16 hours), the medium is removed and replaced with 1 ml of supplemented TNM- 

20 FH containing 0.001-0.1% primary transfection supernatant. The virus is allowed to adsorb to the cells for 1- 
2 hours at 27 °C, after which it is removed and replaced with 4 ml of molten (39 *C) supplemented TNM-FH 
containing antibiotics and 1.5% low gelling temperature agarose. Once the agarose gels, the cells are 
transferred to an humidified 27 *C incubator for 4-6 days. Occlusion-negative plaques are then identified by 
visual inspection under a stereo microscope and an agarose plug overlying each desired plaque is picked 

25 and diluted into 1 ml of supplemented TNM-FH. 

The plaque purification procedure is repeated two additional times using 1-10% of the total virus 
recovered from each plaque. At the end of the third round, all of the virus recovered from a single plaque 
(i.e., 1 ml) is added to 2 x 10 6 Sf9 cells seeded in a 25 cm 2 flask and the flask is incubated at 27 °C for 1-2 
hours. The virus is then removed and replaced with 3 ml of supplemented TNM-FH, and the flask is 

30 returned to the incubator. At the end of 5 days, the supernatant, which is designated as the passage 1 or 
"P1" virus stock, is clarified by centrifugation for 10 minutes at 2000 rpm in a Beckman GPR centrifuge. 
Archive samples of the P1 stock are kept at -150 ° C, and the remainder of the virus is stored at 4 * C. 

Confirmation that the occlusion-negative viruses contain the desired AalT gene is achieved by dot blot 
hybridization for the codon optimized AalT-viruses (Example 6 below) and by polymerase chain reaction 

35 (PCR) for the native AalT-virus (Example 7 below). 

Example 6 

Confirmation of Recombinant Viruses By Dot Blot Hybridization 

40 

To confirm the presence of the codon optimized AalT gene in occlusion-negative viruses, virus-infected 
Sf9 cells from Example 5 are assayed by dot blot hybridization for nucleic acids that hybridize to a 
radiolabeled DNA probe specific for the AalT gene. For this method, the cell pellets obtained during 
clarification of the viral P1 stocks are each resuspended in 1 ml Dulbecco's Phosphate Buffered Saline (D- 

45 PBS) and 100 ml aliquots are removed for hybridization. The cells are pelleted in an Eppendorf centrifuge at 
maximum speed for 15 seconds and then solubilized in 0.5 ml 0.5 M NaOH. After 10 minutes at room 
temperature, each sample is neutralized with 50 ul of 10 M NH 4 OH. With the aid of a 96 well dot blot 
apparatus, 10 to 100 M.I of the solubilized cell pellet is passed through a 3 mm diameter circle on a 
nitrocellulose membrane (Schleicher and Schuell BA85, (Keene, NH)) equilibrated with 1 M NH 4 OH, 0.02 M 

50 NaOH. The wells of the blot apparatus are then washed with 0.5 ml 1 M NH*OH, 0.02 M NaOH. The filter is 
removed from the apparatus, rinsed with 5 x SSPE (0.9 M NaCI, 50 mM NaH 2 P04, 16.25 mM NaOH, 5 mM 
EDTA), and the nucleic acids are baked onto the membrane in a vacuum oven at 80 *C for 2-3 hours. 
Before use, the membrane is incubated overnight at 42 °C in prehybridization buffer (5 x SSPE, 50%(v/v) 
formamide, 0.2% SDS, 5 x Denhardt's (0.1% (w/v) polyvinylpyrrolidone, 0.1% (w/v) Ficoll 400™ (Pharmacia 

55 LKB, Piscataway, NJ), 0.1 % (w/v) bovine serum albumin)). 

The hybridization probe for codon optimized AalT gene constructs is a ^P-labeled double-stranded 
DNA prepared by annealing oligonucleotides E1 and E2 (described in Example 2) and then filling in the 
single-stranded regions with phage T7 DNA polymerase in the presence of 32 P-tabeled dATP and 
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uniabelled dCTP, dGTP and TTP. The radiolabeled probe is denatured by heating and then diluted in 
hybridization buffer (prehybridization buffer supplemented with 100 ug/ml sheared and denatured salmon 
sperm DNA). The membrane is placed in the probe-containing hybridization buffer and incubated at 37 "C 
for 16 hours. Unhybridized probe is then removed through four successive 20-30 minute washes: two at 
room temperature in 2 x SSPE, 0.2% SDS, and two at 65 *C in 0.2 x SSPE, 0.2% SDS. Excess fluid is 
removed from the membrane and the membrane-bound radioactivity is detected by autoradiography using 
Kodak XAR medical X-ray film in the presence of an Du Pont Cronex™ (Wilmington, DE) Lightning Plus 
intensifying screen. Using this assay, the presence of the AalT gene is confirmed in 93% of the putative 
codon optimized AalT-virus isolates examined. 

Example 7 

Confirmation of Recombinant Viruses Using the Polymerase Chain Reaction (PCR) 

To verify that the putative native AalT recombinant viruses contain an insert of the correct size, the 
distance between two known marker sites flanking the site of AalT gene insertion is measured by PCR 
analysis of budded virus. The procedure is a modification of that described by B. Maiitschek and M. Schartl 
(35). The primers for this analysis are located 84 to 110 bases upstream (5'-CAATATATAGTTGCT- 
GATATCATGGAG-3'; PVLForward) and 205 to 226 bases downstream (S'-GGATTTCCTTGAAGAGAGT- 
GAG-3'; PVLReverse) of the AcMNPV polyhedrin gene translational start site- 
To prepare virus samples for analysis, 4 ul aliquots of each P1 stock, and of the wild-type AcMNPV E2 
strain as a control, are each mixed in a GeneAmp tube (Perkin-Elmer/Cetus, Norwalk, CT) with 21 ill of 
Buffer A (10 mM Tris-HCI (pH 8.3), 40 mM KCI, 0.1 mg/ml gelatin, 0.45%(v/v) of Nonidet™ P40 (Shell Oil 
Co.), 0.45% (v/v) Tween™ 20 (ICI Americas)) supplemented with 200 ug/ml of a non-specific protease from 
Streptomyces griseus . Each tube is then incubated in a Perkin-Elmer/Cetus DNA Thermal Cycler running a 
single cycle three temperature program: (1) 55 *C for 60 minutes, (2) 95 «C for 12 minutes, and (3) soak at 
4°C. The function of this program is to liberate the viral DNA from the budded virus and to destroy the 
pronase prior to the PCR amplification step. 

Each sample is prepared for the amplification reaction by adding 24.5 ul of a solution containing 1 x 
Buffer A, 3.0 mM MgCI 2 , 400 UM dATP, 400 RM dGTP, 400 U.M dCTP, 400 U-M TTP, 50 pmol primer 1 
(PVLForward), 50 pmol primer 2 (PVLReverse). The PCR is initiated by adding 0.5 u\ (2.5 units) of 
AmpliTaq DNA polymerase (Perkin-Elmer/Cetus) and starting a thermal cycler program consisting of 25-35 
temperature cycles, wherein each cycle consists of (1) 1 minute at 94 "C, (2) 1.5 minutes at 55 *C, and (3) 
2.5-3.0 minutes at 72 °C. On the last cycle, the incubation time at 72 °C is extended an additional 7 
minutes, and the samples are then soaked at 4°C. An aliquot of each PCR reaction is then analyzed by gel 
electrophoresis in the presence of a suitable set of DNA size markers. 

Each of four different putative native AalT recombinant viruses tested by this method yields a single 
PCR product, the mobility of which is identical among the different samples and commensurate with the 
predicted size of 493 bp for the fragment being amplified. This fragment contains the primers flanking the 
site of gene insertion. Similarly, a single PCR product is obtained with the wild-type AcMNPV strain E2 
virus, and its mobility is in good agreement with the predicted size of 337 bp. 

Example 8 

Analysis of RNA Levels in Virus- Infected Cells 

Expression of the AalT gene in Sf9 cells infected with the codon optimized and native AalT-viruses is 
estimated by Northern blot analysis of total cellular RNA isolated from virus-infected cells. Total cell RNA is 
isolated by the guanidine thiocyanate/CsCI procedure (36,37). 

In this procedure, Sf9 cells (5 x 10 7 in a 150 cm 2 flask) are infected with virus at an MOl of 5 PFU/cell 
and incubated at 27 °C for 24 hours. The cells are then harvested by centrifugation (2000 rpm for 10 
minutes in a Beckman GPR centrifuge) and the cell pellet is resuspended in 10 ml of 4 M guanidine 
thiocyanate solution (50% (w/v) guanidine thiocyanate, 0.5% (w/v) N-laurylsarcosine, 07% (v/v) 0-mercap- 
toethanol, 25 mM sodium citrate (pH 7.0)). Total RNA is then partially purified by sedimentation through a 
5.7 M CsCI, 0.1 M EDTA pad in a Beckman 70.1 Ti rotor spun at 60,000 rpm for 4.5 hours. The RNA pellet 
is redissolved in 2 ml TES buffer (10 mM Tris-HCI (pH 7.5), 5 mM EDTA, 1% (w/v) SDS) and incubated for 
5-10 minutes at 55 "C to aid dissolution of the sample. The solution is then adjusted to 0.15 M NaCI and 
extracted once with water-saturated phenol. The aqueous phase is removed and the organic phase is re- 
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extracted twice more with 2 mi TES, 0.15 M NaCI. The aqueous phases are then combined and the RNA is 
precipitated with ethanol. The RNA pellet is redissolved in 2 ml H 2 0 and reprecipitated overnight at 4-C 
with an equal volume ol 4 M LiCI. The pellet is then washed once with ice cold 2 M LiCI and redissolved in 
0.5 ml H 2 0. Residual LiCI is then removed by one last ethanol precipitation and the final pellet is 

5 redissolved in 0.2 ml H 2 0. The RNA yield is determined by UV spectroscopy. 

For Northern analysis 20 ug of each RNA is size fractionated by electrophoresis on a 1% agarose gel 
containing 2.7 M formaldehyde, 40 mM MOPS (4-morpholinepropanesulfonic acid) (pH 7.0), 10 mM sodium 
citrate, 1 mM EDTA. The RNA is then transferred by capillary blotting to a nitrocellulose membrane 
(Schleicher and Schuell BA85) in the presence of 20 x SSC (3 M NaCI, 0.3 M sodium citrate; adjusted to 

70 pH 7.0 with NaOH). The membrane is prehybridized briefly in hybridization buffer (50% (v/v) formamide, 0.9 
M NaCI, 50 mM sodium phosphate (pH 7.0), 5 mM EDTA, 0.1% (w/v) SDS, 4 x Denhardt's, 0.4 mg/ml 
tRNA, 0.25 mg/ml calf thymus DNA) and then hybridized for 16 hours at 42 *C in the same buffer 
containing a ^P-labeled DNA probe prepared by random priming (38) a 464 bp Bam HI to Kpn I restriction 
fragment lying immediately 3' to the site of AalT gene insertion in the AcMNPV polyhedrin gene. Following 

15 hybridization, unbound probe is removed by four successive 30 minute washes with 0.25 x SSC, 0.1% SDS 
at 65 *C. A digitized image of the membrane-bound probe is produced by autoradiography using a 
Molecular Dynamics Phosphorlmager™ , and individual bands are quantified using ImageQuant™ v3.15 
software (Molecular Dynamics, Sunnyvale, CA). 

Figure 4 summarizes an analysis of AalT/polyhedrin RNA levels in Sf9 cells infected with wild-type 

20 AcMNPV (labelled "wild-type") and with the codon optimized (labelled with the specific heterologous signal 
sequence) and native (labelled grp III AalT) recombinant viruses. Only one species of polyhedrin-containing 
RNA is detected in each sample and all are of a size which is appropriate for the structure of the polyhedrin 
or AalT/polyhedrin gene present in the virus. The amount of AalT/polyhedrin RNA detected in the codon 
optimized AalT-virus-infected cells ranges from 45-67% of the level of polyhedrin RNA accumulated in wild- 

25 type AcMNPV-infected cells. The corresponding value for native AalT-virus-infected cells is 30%. This 
analysis shows that the AalT/polyhedrin gene is correctly utilized and highly active in AalT-virus-infected 
Sf9 cells. 

Example 9 

30 

Production of Biologically Active AalT in Virus-Infected Sf9 Cells 

AalT causes an acute excitatory (contractile) paralysis when 1 ng or more is injected into the 
dorsolateral area of Sarcophaga argyrostoma (flesh fly) larvae (39) A similar assay, using Musca domestica 
35 (house fly) larvae, is used to determine whether a detectable amount (1 ng) of biologically active AalT is 
present in either the culture medium or cell pellets obtained during the preparation of the codon optimized 
AalT viral P1 stocks. 

To assay the culture medium, 2-6 ul of each P1 stock (600-1000 cell equivalents/til) are injected into 
the dorsolateral area M. domestica larvae using a Hamilton syringe equipped with a 26 gauge needle. 
40 Larvae are examined five seconds after injection for evidence of involuntary contraction of body segments. 
Five larvae are assayed for each virus. 

To assay AalT activity in the cell pellets, the virus 7 infected Sf9 cells are washed once by centrifugation 
and the resuspended at a density of 30,000 cells per microliter in Dulbecco's phosphate buffered saline (D- 
PBS). An aliquot of the cell suspension is removed and frozen on dry ice and then thawed at 37 ' C three 
45 times in quick succession. Two microliters of the freeze/thaw lysate (i.e., 60,000 cell equivalents) are then 
injected into M. domestica larvae as described above and examined for acute contraction of body 
segments. 

The table below summarizes the results obtained with an interleukin-2 signal-AalT-virus mutant 
(containing a frame shift mutation) and three heterologous signal-codon optimized AalT-virus isolates. 

50 Biologically active AalT cannot be detected in any of the P1 stocks (cell culture supernatants) tested in this 
assay, indicating that the concentration of active toxin is in the range of 0.2 ng/nl or less in the 
supernatants. In contrast, AalT is easily detected in 13 of 17 cell pellet lysates. Three of the four negative 
lysates are derived from the mutant IL2-AalT-virus isolates, which contain a frame shift mutation at position 
265 in the toxin gene coding region. The other negative is obtained with isolate T9.4.1 (putative Cuticle- 

55 AalT), which is negative by dot blot hybridization for the AalT gene insert. The other four putative Cuticle- 
AalT constructs give positive results, as do all nine of the Esterase-6-AalT and adipokinetic hormone-AalT 
constructs. 
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Example 10 

Analysis of Virus Performance by Injection into Larvae 

To test whether insertion of the AalT gene enhances AcMNPV performance, budded virus prepared 
from the codon optimized Cuticle-AalT and from the wild-type E2 strain of AcMNPV without the AalT gene 
are bioassayed by injection into mid-fourth instar Heliothis virescens (tobacco bud worm) larvae. Each virus 
is titered by the plaque assay method, as described by M. D. Summers and G. E. Smith (25), and then 
diluted to 2 x 10 7 , 2 x 10 e , and 2 x 1tf> PFU/ml in TNM-FH medium supplemented with 0.5% (v/v) red dye 
number 5. Each larva is anesthetized with carbon dioxide for 2-5 minutes and then injected with 0.5 nl of 
diluted virus, using a Hamilton syringe equipped with a 26 gauge needle. The needle is inserted 
longitudinally between the last two prolegs and then moved anteriorly two to three body segments prior to 
injection. Following injection, each larva is inspected for the release of dye-stained hemolymph and 
discarded if sample loss is evident or suspected. The larvae are then stored at 27 *C in covered 4 cm 2 diet 
ceils (one larva per cell) and inspected 2-3 times daily for evidence of morbidity or mortality. An individual 
is scored as moribund (positive response) if it is unable to right itself within 0.5-2 minutes after being turned 
on its back. 

Figure 5 summarizes the results obtained when H. virescens larvae are injected with 10 2 , 10 3 and 10* 
PFU of each virus. Forty-eight larvae are used for each virus dose. Eighty insects ("No Virus") are injected 
with TNM-FH alone as a negative control. The results show that the- Cuticle-AalT AcMNPV kills its host 
faster than the wild-type AcMNPV at each of the doses tested. At 10* PFU, the LTso for the wild-type virus 
is approximately 97 hours, whereas the corresponding LT 5 o for the Cuticle-AalT AcMNPV is 67 hours. 
Moreover, virtually all responding larvae infected with Cuticle-AalT, but not those infected with the wild-type 
AcMNPV, suffer from contractile paralysis prior to death. This result shows that insertion of the AalT gene 
and the cuticle signal sequence into AcMNPV accelerates the speed of kill through the expression of 
biologically active toxin. 
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Example 11 

Further Analysis of Virus Performance by Injection into Larvae 

5 The procedure of Example 10 is repeated by injecting separate groups of early fifth instar Heliothis 
virescens larvae with 10 4 PFU of budded virus prepared from either the wild-type E2 strain of AcMNPV 
without the AalT gene, a native AalT signal-native AalT gene-AcMNPV construct, or one of seven different 
codon optimized heterologous signal-AalT-AcMNPV constructs. A given virus is administered to 32 larvae. 
Three separate assays are conducted. Larvae damaged by reason of the injection are not included in the 

70 analysis. The data are pooled and the results are depicted in the following Table: 
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In the Table, "n n represents the total number of larvae from the three assays, "df" represents the 
55 degrees of freedom, "N/A M means not applicable, and "L1(95%)" and "L2(95%)" represent the confidence 
limits of the data, such that if the assay was rerun with an equal number of larvae, there would be a 95% 
confidence level that the LTso would fall within the values shown for L1 and L2. 
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The results show that each of the codon optimized heterologous signal-AalT-AcMNPV constructs (as 
well as the native AalT signal-native AalT gene-AcMNPV construct) kills its host faster than the wild-type 
AcMNPV. The LT 50 for the wild-type virus is approximately 126 hours, whereas the corresponding LT 50 for 
the seven heterologous signal-AalT-AcMNPV constructs range from approximately 68-89 hours and the 
native AalT-AcMNPV construct has an LT 5 o of approximately 74 hours. These results confirm the results of 
Example 10 that insertion of the AalT gene and a heterologous signal sequence into AcMNPV accelerates 
the speed of kill through the expression of biologically active toxin. 

Example 12 

Co-Occlusion of Wild-Type and Occlusion-Negative Recombinant Viruses In Viral Polyhedra 

Because the recombinant AalT-viruses are defective for the production of polyhedrin, polyhedra 
containing a mixture of wild-type and recombinant virions are prepared by co-infecting Sf9 cells with the 
recombinant AalT-virus and a wild-type helper virus (i.e., the E2 strain of AcMNPV). To determine the 
amount of each virus to use for co-infection, 2 x 10 7 Sf9 cells are seeded in a 150 cm 2 flask and infected 
with various amounts of the wild-type and recombinant viruses, each at a multiplicity of infection (MOI) of at 
least 1 plaque forming units (PFU) per cell Four days later, the cells are counted and the percentage of 
cells with clearly discernible viral occlusions in the cell nucleus is determined by microscopic visualization. 
The total number of polyhedra is also determined by counting polyhedra in samples treated with 0.2% 
Triton X-100 and 2% (w/v) SDS. The results of one such analysis involving the co-occlusion of wild-type E2 
AcMNPV and the cuticle-AalT recombinant AcMNPV are summarized in the following Table: 



Culture No. 


MOI [WT:Cut-AalT] 


Total cells x 10 7 


Percent of cells 
with occlusions 


Total polyhedra 
x10 8 
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1 
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2.30 
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3.30 


14.3 


4 
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1.85 


74% 


2.15 


11.6 


5 


2:10 


2.00 


36 % 


0.55 


2.75 


6 


2:20 


2.10 


23% 


0.12 


0.57 



These results demonstrate that even when the MOI of the wild-type virus is at least 2 PFU per cell, both 
the percentage of cells with viral polyhedra (occlusions) and the average number of polyhedra per cell 
decrease significantly as the MOI of the recombinant virus is increased. To maintain a large fraction of 
recombinant virions per polyhedron without making unacceptable sacrifices in the yield of polyhedra, all 
subsequent co-occlusion experiments are done with both viruses (i.e., wild-type and occlusion-negative 
recombinant) at an MOI of 3 each. 

Example 13 

Microdrop Bioassay to Measure the Oral Activity of Codon Optimized and Native Sequence Recombinant 
AalT-Viruses Co-occluded into Polyhedra with Wild-Type AcMNPV 

To examine the oral toxicity of the recombinant AcMNPV containing a codon optimized or native 
sequence encoding AalT, each virus (with the exception of the »L-2/AalT recombinant, which is not assayed) 
is co-occluded with wild-type AcMNPV into viral polyhedra (see Example 12) and fed to second instar 
Heliothis virescens larvae using a microdrop bioassay procedure as follows. Individual second instar 
Heliothis virescens larvae are transferred into empty assay wells containing a disk of prewetted filter paper. 
One larva is placed per well. The larvae are then stored at 26 'C, 50% relative humidity for 12 to 20 hours 
(overnight). 

The next day, a 15 ml aliquot of Stoneville insect diet (40) is heated to boiling. Five mis of water are 
added to the melted diet. This mixture is reheated to boiling and promptly centrifuged at low speed. 
Supernatant is removed and a few drops of food dye are added to the clarified supernatant as an aid to 
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visualize the diet. The molten diet is aliquoted into microfuge tubes, respun in a Beckman microfuge, and 
the clarified supernatant is transferred into new microfuge tubes. This molten diet is placed in a heat block 
at 54 ° C until needed. 

After the diet cools to 54 *C, the desired amount of PIBs is added. Typically, this ranges from 10 to 250 

5 PIBs per nl After mixing well, 1 n\ drops of treated diet are aliquoted onto PARAFILM™ (American 
National Can, Greenwich, CT), where they harden. The hardened drops are quickly transferred to the wells 
containing the individual larvae, one drop per larva. The larvae are allowed to feed for 2 hours and those 
larvae which consume the entire microdrop are placed into a new assay well containing a standard amount 
of untreated Stoneville diet. Larvae are then placed at 26 °C, 50% relative humidity, and monitored twice 

70 daily for paralysis and death. Paralysis is determined by rolling larvae onto their backs and observing for 30 
seconds. If the larva remains on its back, it is considered moribund. Moribund larvae generally die during 
the 24 hours following diagnosis. The duration of the test is generally 8 days. 

Larvae are inspected two to three times daily and scored for evidence of morbidity and mortality. Both 
dead and moribund larvae are scored as responding to the treatment. Figures 6-15 summarize the results 

75 obtained with each of the viruses. 

With one exception, all of the recombinant viruses demonstrate an earlier onset of response than the 
wild-type AcMNPV (which lacks the AalT gene). The exception is the AcMNPV construct containing the 
pBMHPC-12 signal sequence linked to the codon optimized cDNA encoding AalT, which performs 
approximately the same as the wild-type AcMNPV. The actual ratio of wild-type to recombinant virus in the 

20 polyhedra and in the infected insects is not analyzed. 
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SEQUENCE LISTING 



(1) GENERAL INFORMATION: 

(i) APPLICANT: American Cyanamid Company 

(A) ADDRESSEE: American Cyanamid Company 

(B) STREET: One Cyanamid Plaza 

(C) CITY: Wayne 

(D) STATE: New Jersey 

(E) COUNTRY: USA 
(P) ZIP : 07470-8426 

(ii) TITLE OF INVENTION: Codon Optimized DNA Sequence For Insect 
Toxin AalT 

(iii) NUMBER OF SEQUENCES: 53 

(v) COMPUTER READABLE FORMt 

(A) MEDIUM TYPE: Floppy disk 

(B) COMPUTER i IBM PC compatible 

(C) OPERATING SYSTEM: PC-DOS/MS-DOS 

(D) SOFTWARE: Patentln Release #1.0, Version #1.25 

(vi) CURRENT APPLICATION DATA: 

(A) APPLICATION NUMBER: 

(B) FILING DATE: 
<C) CLASSIFICATION: 

(viii) ATTORNEY /AGENT INFORMATION: 

(A) NAME: Wachtershauser, Dr. Gunter 

(C) REFERENCE/DOCKET NUMBER: EA- /31867-OQ 

(ix) TELECOMMUNICATION INFORMATION: 

(A) TELEPHONE: (089) 29 39 06 

(B) TELEFAX: (089) 22 37 59 

(C) TELEX: 521 41 73 Patw-D 



(2) INFORMATION FOR SEQ ID NO:l: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 213 base pairs 

(B) TYPE: nucleic acid 

(C) STRAND EDNESS : single 

(D) TOPOLOGY : linear 

(ii) MOLECULE TYPE: DNA (genomic) 
(iii) HYPOTHETICAL: NO 
(iv) ANTI-SENSE: NO 
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(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Scorpion Androc tonus Australia Hector 

(ix) FEATURE: 

(A) NAME /KEY: CDS 

(B) LOCATION: 1..213 



20 



25 



30 



<xi) SEQUENCE DESCRIPTION: SEQ ID NO:li 

AAG AAG AAT GGA TAT GCC GTC GAT AGT AGT GGT AAA GOT CCT GAA TGT 48 
Lys Lya Aan Gly Tyr Ala Val Asp Ser Ser Gly Lys Ala Pro Glu Cys 
1 5 10 15 

CTT TTG AGC AAT TAC TGT AAC AAC GAA TGC ACA AAA GTA CAT TAT GCT 96 
Leu Leu Ser Aan Tyr Cys Aan Aan Glu Cya Thr Lye Val His Tyr Ala 
20 25 30 

GAC AAA GGA TAT TGC TGC TTA CTT TCA TGT TAT TGC TTC GGT CTA AAT 144 
Asp Lys Gly Tyr Cys Cys Leu Leu Ser Cys Tyr Cys Phe Gly Leu Asn 
35 40 45 

GAC GAT AAA AAA GTT TTG GAG ATT TCG GAC ACA AGG AAA AGT TAT TGT 192 
Asp Asp Lys Lys Val Leu Glu lie Ser Asp Thr Arg Lys Ser Tyr Cys 
50 55 60 

GAC ACC ACA ATA ATT AAT TAA 213 
Asp Thr Thr lie lie Asn 
65 70 

(2) INFORMATION FOR SEQ ID NO: 2: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 70 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: protein 

35 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:2: 

Lys Lys Asn Gly Tyr Ala Val Asp Ser Ser Gly Lys Ala Pro Glu Cys 
1 5 10 15 

Leu Leu Ser Asn Tyr Cys Aan Asn Glu Cys Thr Lys Val His Tyr Ala 

40 20 25 30 

Asp Lye Gly Tyr Cys Cya Leu Leu Ser CyB Tyr Cya Phe Gly Leu Asn 
35 40 45 



Asp Asp Lys Lys Val Leu Glu lie Ser Asp Thr Arg Lys Ser Tyr Cys 



45 



50 



55 
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50 



55 



60 



Asp Thr Thr lie Xle Aflii 
65 70 



(2) INFORMATION FOR SEQ ID NO: 3: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 213 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(iii) HYPOTHETICAL: NO 

(iv) ANTI-SENSE: NO ' 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Scorpion Androc tonus Austral is Hector 

(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: l.*213 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 3: 

AAO AAG AAC GOC TAC OCA GTC GAC TCA TCC GGA AAA GCC CCC GAG TGC 
Lys Lys Asn Gly Tyr Ala Val Asp Ser Ser Gly Lys Ala Pro Glu Cys 
1 5 10 15 

CTG CTC TCG AAC TAT TGC AAC AAT GAA TGC ACC AAG GTG CAC TAC GCT 
Leu Leu Ser Asn Tyr Cys Asn Aon Glu Cys Thr Lys Val His Tyr Ala 
20 25 30 

OAC AAG OGC TAC TGT TGC CTT CTG TCC TGC TAT TGC TTC GOT CTC AAC 
Asp Lys Gly Tyr Cyo Cys Leu Leu Ser Cys Tyr Cys Phe Gly Leu Asn 
35 40 45 

GAC OAC AAG AAA GTT CTG GAA ATC TCT GAT ACT CGC AAG AGC TAC TGT 
Asp Asp Lys Lys Val Leu Glu lie Ser Asp Thr Arg Lys Ser Tyr Cys 
50 55 60 

GAC ACC ACC ATC ATT AAC TAA 
Asp Thr Thr lie He Asn 
65 70 



<2) INFORMATION FOR SEQ ID NO:4: 
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20 



25 



30 



35 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 70 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY; linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 4: 

Lys Lys Asn Gly Tyr Ala Val Asp Ser Ser Gly Lys Ala Pro Glu Cys 
1 5 10 15 

Leu Leu Ser Asn Tyr Cys Asn Asn Glu Cys Thr Lys Val His Tyr Ala 
20 25 30 

Asp Lys Gly Tyr Cys Cys Leu Leu Ser Cys Tyr Cys Phe Gly Leu Aen 
35 40 45 

Asp Asp Lys Lys Val Leu Glu He Ser Asp Thr Arg Lys Ser Tyr Cys 
50 55 €0 

Asp Thr Thr He He Asn 
65 70 

(2) INFORMATION FOR SEQ ID NO: 5: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 60 base pairs 

(B) TYPE: nucleic acid 

(C) STRAND EDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(iii) HYPOTHETICAL: NO 

(iv) ANTI- SENSE: NO 

(vi) ORIGINAL SOURCE : 

(A) ORGANISM: Bombyx mori 

(ix) FEATURE: 

(A) NAME/KEY; CDS 

(B) LOCATION: 13.. 60 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 5: 

CCCCCCGOAT CC ATG AAA CTC CTG GTC GTG TTC GCC ATG TGC GTG CCC 48 
Met Lys Leu Leu Val Val Phe Ala Met Cys Val Pro 
1 5 10 
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OCT GCC AGC GCT 
Ala Ala Ser Ala 
15 



(2) INFORMATION FOR SEQ ID NO: 6: 

(1) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 16 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 6: 

Met Lys Leu Leu Val Val Phe Ala Met Cy» Val Pro Ala Ala Ser Ala 
1 5 10 15 



20 



25 



(2) INFORMATION FOR SEQ ID NO: 7: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 59 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(iii) HYPOTHETICAL: NO 

(iv) ANTI- SENSE: NO 

(vi) ORIGINAL SOURCE: 
30 (A) ORGANISM: Bombyx mori 

(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 12.. 59 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 7 : 

CTCCTAACAA A ATG AAA CTT CTC GTT GTG TTC GCA ATG TGC GTG CCT GCC 50 
Met Lys Leu Leu Val Val Phe Ala Met Cys Val Pro Ala 
15 10 

GCC AGC GCC 59 
Ala Ser Ala 
15 
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(2) INFORMATION FOR SEQ ID NOtSi 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 16 amino acids 
5 (B) TYPE: amino acid 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein' 

(XX) SEQUENCE DESCRIPTION: SEQ ID NO: 8: 

10 

Met Lys Leu Leu Val Val Phe Ala Met Cys Val Pro Ala Ala Ser Ala 
15 10 15 



(2) INFORMATION FOR SEQ ID NO: 9: 

SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 69 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

MOLECULE TYPE: DNA (genomic) 



ID 



20 



(i) 



(ii) 



(iii) HYPOTHETICAL: NO 



(iv) ANTI-SENSE: NO 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Manduca sexta 

(ix) FEATURE; 

(A) NAME/KEY: CDS 

(B) LOCATION: 13.. 69 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 9: 



CCCCCCGGAT CC ATG TAG AAA CTO ACC GTC TTC CTG ATG TTC ATC GCC 48 
Met Tyr Lys Leu Thr Val Phe Leu Met Phe lie Ala 
1 5 10 

TTC GTG ATT ATC GCT GAG GCC 69 
Phe Val lie lie Ala Glu Ala 
15 



(2) INFORMATION FOR SEQ ID NO: 10: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 19 amino acids 
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(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPB: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 10: 

Met Tyr Lye Leu Thr Val Phe Leu Met Phe He Ala Phe Val He He 
1 5 10 15 

Ala Glu Ala 

(2) INFORMATION FOR SEQ ID NO: 11: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 76 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(iii) HYPOTHETICAL: NO 

(iv) ANTI-SENSE: NO 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Manduca sexta 

( ix) FEATURE : 

(A) NAME /KEY: CDS 

(B) LOCATION: 20.. 76 

30 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 11: 

GTCGATATCA TCAATCAAG ATG TAC AAG CTC ACA GTC TTC CTG ATG TTC ATC 52 
Met Tyr Lys Leu Thr Val Phe Leu Met Phe He 
1 5 10 

35 GCT TTC GTC ATC ATC GCT GAG GCC 76 
Ala Phe Val He He Ala Glu Ala 
15 



15 



20 



25 



40 



45 



50 



55 



(2) INFORMATION FOR SEQ ID NO: 12: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 19 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 
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(i±) MOLECULE TYPE: protein 

(xi) SEQUBNCE DESCRIPTION: SEQ ID NO: 12: 

Met Tvx Lys Leu Thr Val Phe Leu Met Phe lie Ala Phe Val He He 
15 10 15 

Ala Glu Ala 



(2) INFORMATION FOR SEQ ID NO: 13: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTHt 81 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDKDNESS i single 

(D) TOPOLOGY : linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(iii) HYPOTHETICAL: NO 

(iv) ANTI-SENSE: NO 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Manduca sexta 

(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 13.. 81 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 13: 

CCCCCCGGAT CC ATG GCC GCT AAA TTC GTC GTG GTT CTG GCC GCT TGC 
Met Ala Ala Lys Phe Val Val Val Leu Ala Ala Cys 
1 5 10 

GTC GCC CTG AGC CAC TCG GCT ATG GTG CGC CGC 
Val Ala Leu Ser His Ser Ala Met Val Arg Arg 
15 20 



(2) INFORMATION FOR SEQ ID NO: 14: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 23 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO:14: 

Met Ala Ala Lys Phe Val Val Val Leu Ala Ala Cys Val Ala Leu Ser 
1 5 10 15 

5 His Ser Ala Met Val Arg Arg 
20 

(2) INFORMATION FOR SEQ ID NO: 15: 

(i) SEQUENCE CHARACTERISTICS: 
™ (A) LENGTH: 89 base pairs 

(B) TYPE: nucleic acid 
<C) STRANDKDNESS: single 
(D) TOPOLOGY: linear 



15 



<ii) MOLECULE TYPE: DNA (genomic) 

(iii) HYPOTHETICAL t NO 

(iv) ANTI- SENSE: NO 

<vi) ORIGINAL SOURCE i 
20 (A) ORGANISM: Manduca sexta 

(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 21.. 89 



25 



30 



40 



<xi) SEQUENCE DESCRIPTION: SEQ ID NO: 15: 

TCCAGTCCAG TCACTTCATC ATG GCA GCC AAG TTC GTC GTG GTT CTC GCC 50 

Met Ala Ala Lys Phe Val Val Val Leu Ala 
1 5 10 

GCG TGC GTG GCC CTC TCG CAC AGC GCG ATG GTG CGC CGC 89 
Ala Cys Val Ala Leu Ser His Ser Ala Met Val Arg Arg 
15 20 

(2) INFORMATION FOR SEQ ID NO:lS: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 23 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY; linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 16: 
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Met Ala Ala Lys Phe Val Val Val Leu Ala Ala Cys Val Ala Leu Ser 
1 5 10 15 

His Ser Ala Met Val Arg Arg 



(2) INFORMATION FOR SEQ ID NO: 17: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH; 75 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

Uii) HYPOTHETICAL: NO 

(iv) ANTI- SENSE: NO 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Bombyx mori 

(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 13.. 75 



25 ( X i) SEQUENCE DESCRIPTION: SEQ ID NO: 17: 

CCCCCCGGAT CC ATG TTC ACC TTC OCT ATT CTG CTC CTG TGC GTG CAA 48 
Met Phe Thr Phe Ala lie Leu Leu Leu Cys Val Gin 
1 5 10 

30 qqc TGC CTG ATC CAG AAT GTT TAC GGA 75 
Gly Cys Leu lie Gin Asn Val Tyr Gly 
15 20 

(2) INFORMATION FOR SEQ ID NO: 18: 

35 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTHS 21 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY : linear 



40 



45 



(ii) MOLECULE TYPE: protein 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: IB: 
Met Phe Thr Phe Ala lie Leu Leu Leu Cys Val Gin Gly Cys Leu He 



50 
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15 10 15 

Gin Asn Val Tyr Gly 
20 

(2) INFORMATION FOR 3EQ ID NO:19: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 76 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(iii) HYPOTHETICAL: NO 

(iv) ANTI- SENSE: NO 

(vi) ORIGINAL SOURCE t 

(A) ORGANISM: Bombyx mori 

(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 14.. 7 6 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 19: 

25 AATATCCAGC ATC ATG TTT ACC TTC OCT ATT CTC CTT CTC TGC GTT CAG 49 
Met Phe Thr Phe Ala lie Leu Leu Leu Cys Val Gin 
15 10 

GGT TGC CTG ATC CAA AAT GTG TAC GOT 76 
Gly Cys Leu lie Gin Asn Val Tyr Gly 

30 15 20 



(2) INFORMATION FOR SEQ ID NO: 20: 

(i) SEQUENCE CHARACTERISTICS: 
35 (A) LENGTH: 21 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPEt protein 

40 (xi) SEQUENCE DESCRIPTION : SEQ ID NO: 20: 

Met Phe Thr Phe Ala lie Leu Leu Leu Cys Val Gin Gly Cys Leu He 
15 10 15 



45 



50 



55 



36 



EP 0 621 337 A1 



Gin Asn Val Tyr Gly 
20 

(2) INFORMATION FOR SEQ ID NO: 21: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 60 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(iii) HYPOTHETICAL: NO 

(iv) ANTI -SENSE: NO 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM : Droaophila melanogaster 

(ix) FEATURE : 

(A) NAME/KEY: CDS 

(B) LOCATION: 13.. 60 



(xi) SEQUENCE DESCRIPTION : SEQ ID NO: 21: 

CCCCCCGGAT CC ATG TTC AAG TTC GTG ATG ATC TGC GCC GTC CTC GGC 
Met Phe Lys Phe Val Met lie Cyo Ala Val Leu Gly 
1 5 10 

CTG GCT GTG GCC 
Leu Ala Val Ala 
15 



(2) INFORMATION FOR SEQ ID NO: 22: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 16 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 22: 

Met Phe Lye Phe Val Met lie Cys Ala Val Leu Gly Leu Ala Val Ala 
15 10 15 



(2) INFORMATION FOR SEQ ID NO: 23: 
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75 



20 



30 



35 



45 



50 



55 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH x 48 base pairs 

(B) TYPE; nucleic acid 
<C) STRANDEDNESS x single 
(D) TOPOLOGY : linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(iii) HYPOTHETICAL: NO 

(iv) ANTI- SENSE: NO 

(vi> ORIGINAL SOURCE; 

(A) ORGANISM: Droaophila melanogaster 

(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 1. .48 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 23: 

ATG TTC AAG TTT GTC ATG ATC TGC GCA GTT TTG GGC CTG GCG GTG GCC 48 
Met Phe Lys Phe Val Met He Cys Ala Val Leu Gly Leu Ala Val Ala 
15 10 15 

(2) INFORMATION FOR SEQ ID NO: 24: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 16 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRXPTION: SEQ ID NO: 24: 

Met Phe Lys Phe Val Met He Cys Ala Val Leu Gly Leu Ala Val Ala 

1 5 10 15 

(2) INFORMATION FOR SEQ ID NO: 25 x 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 75 base pairs 

(B) TYPEt nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
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75 



(iii) HYPOTHETICAL: NO 

(iv) ANTI-SENSE: NO 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Drosophila melanogaster 

(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 13.. 75 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 25: 

CCCCCCGGAT CC ATG AAC TAC GTC GGG CTG GGC CTC ATC ATT GTG CTG 48 
Met Ann Tyr Val Gly Leu Gly Leu He He Val Leu 
1 5 10 

TCG TGC TTG TGG CTG GGG AGC AAT GCT 75 
Ser Cys Leu Trp Leu Gly Ser Asn Ala 
15 20 



20 (2) INFORMATION POR SEQ ID NO: 26: 

(1) SEQUENCE CHARACTERISTICS ; 

(A) LENGTH: 21 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 



25 



40 



(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 26: 

Met Ann Tyr Val Gly Leu Gly Leu He He Val Leu Ser Cys Leu Trp 

15 10 15 

Leu Gly Ser Asn Ala 
20 

(2) INFORMATION FOR SEQ ID NO: 27: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 87 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
(iii) HYPOTHETICAL: NO 



50 



55 



39 



EP 0 621 337 A1 



(iv) ANTI- SENSE : NO 

(v±) ORIGINAL SOURCE: 

(A) ORGANISM: Drosophila melanogaster 

(ix) FEATURE: 

(A) NAME /KEY: CDS 

(B) LOCATION: 25.. 87 

(xi) SEQUENCE DESCRIPTION: SEQ ID N0:27: 

GAATTCGCCG GAGTGAGGAG CAAC ATG AAC TAC GTG GGA CTG GOA CTT ATC 51 

Met Asn Tyr Val Gly Leu Gly Leu He 
1 5 

ATT GTG CTG AGO TGC CTT TGG CTC GGT TCG AAC GCG 87 
He Val Leu Ser Cys Leu Trp Leu Gly Ser Asn Ala 
10 15 20 

(2) INFORMATION FOR SEQ ID NO: 28: 

(1) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 21 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

25 (ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 28: 

Met Aen Tyr Val Oly Leu Gly Leu He He Val Leu Ser Cys Leu Trp 
15 10 15 



75 



20 



30 



45 



50 



55 



Leu Gly Ser Aen Ala 
20 

(2) INFORMATION FOR SEQ ID NO:29: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 57 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single (D) TOPOLOGY: linear 
(ii) MOLECULE TYPE: DNA (genomic) 

(iii) HYPOTHETICAL: NO 
(iv) ANTI-SENSE: NO 
(vi) ORIGINAL SOURCE; 
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(A) ORGANISM: Bombyx mori 

(ix) FEATURE: s 

(A) NAME/KEY: CDS 

(B) LOCATION: 13.. 57 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 29: 

CCCCCCGGAT CC ATO COC QTC CTG OTO CTG TTG GCC TGC CTG GCA GCC 
Met Arg Val Leu Val Leu Leu Ala Cys Ala Ala 

1 5 10 

GCT AGC GCT 
Ala Ser Ala 
15 



(2) INFORMATION FOR SEQ ID NO: 30: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 15 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY : linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SBQ ID NO:30: 

Met Arg Val Leu Val Leu Leu Ala Cye Leu Ala Ala Ala Ser Ala 
1 5 10 15 

(2) INFORMATION FOR SEQ ID NO:31: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 65 base pairs 

(B) TYPE : nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(iii) HYPOTHETICAL: NO 

(iv) ANTI-SENSE: NO 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Bombyx mori 

(ix) FEATURE: 

(A) NAME /KEY: CDS 

(B) LOCATION: 21. .65 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 31: 



CACTCAGGAG TTCTTCAAAC ATO AGO GTT CTA GTA CTA CTG GCC TGC TTG 

Met Arg Val Leu Val Leu Leu Ala Cys Leu 
15 10 



50 



5 



GCC GCG GCG TCA GCC 
Ala Ala Ala Ser Ala 
15 



65 



70 



(2) INFORMATION FOR SEQ ID NO: 32: 



75 



(i) 



SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 15 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 



Ui) 



MOLECULE TYPE: protein 



(xi) 



SEQUENCE DESCRIPTION: SEQ ID NO: 32: 



20 Met Arg Val Leu Val Leu Leu Ala Cys Leu Ala Ala Ala Ser Ala 
15 10 15 

(2) INFORMATION FOR SEQ ID NO: 33: 

(i) SEQUENCE CHARACTERISTICS : 
2 5 '' (A) LENGTH: 100 base pairs 

(B) TYPE; nucleic acid 

(C) STRAND EDNESS : single 

(D) TOPOLOGY : linear 

(ii) MOLECULE TYPE: DMA (genomic) 

30 

(iii) HYPOTHETICAL: NO 

<iv) ANTI- SENSE: NO 

(vi) ORIGINAL SOURCE: 
qf . (A) ORGANISM: Scorpion Androc tonus Australis Hector 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 33: 
AGCCCCCGAG TGCCTGCTCT CGAACTATTG CAACAATGAA TGCACCAAGG TGCACTACGC 60 
TGACAAGGGC TACTGTTGCC TTCTGTCCTG CTATTGCTTC 100 
(2) INFORMATION FOR SEQ ID NO: 34: 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 110 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNBSS : single 

(D) TOPOLOGY : linear 

(ii) MOLECULE TYPE; DNA (genomic) 

(iii) HYPOTHETICAL: NO 

(iv) ANTI -SENSE; NO 

(vi) ORIGINAL SOURCE; 

(A) ORGANISM: Scorpion Androctonus Australia Hector 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 34: 
CTGTAGGTAC CGGATCCTTA GTTAATGATG GTGGTGTCAC AGTAGCTCTT GCGAGTATCA 60 
GAGATTTCCA GAACTTTCTT GTCGTCGTTG AGACCGAAGC AATAGCAGGA 110 
(2) INFORMATION FOR SEQ ID NO: 35: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH; 51 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNBSS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
(iii) HYPOTHETICAL: NO 
(iv) ANTI -SENSE: NO 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Scorpion Androctonus Austral is Hector 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 35: 

35 

AGGCACTCGG GGGCTTTTCC GGATGAGGTC GACTGCGTAG CCGTTCTTCT T 51 

(2) INFORMATION FOR SEQ ID NO: 36: 

(i) SEQUENCE CHARACTERISTICS: 
40 (A) LENGTH: 87 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNBSS: single 

(D) TOPOLOGY: linear 
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(ii) MOLECULE TYPE: DNA (genomic) 

(iii) HYPOTHETICAL: NO 

(iv) ANTI- SENSE: NO 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Human 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 36: 
CCCCCCGGAT CCATGTACCG CATGCAGCTG CTCTCCTGCA TCGCCCTGTC GCTGGCTCTG 60 
GTGACCAATA GCAAGAAGAA CGGCTAC 8 7 

(2) INFORMATION FOR SEQ ID NO:37: 

(1) SEQUENCE CHARACTERISTICS i 

(A) LENGTH: 84 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(iii) HYPOTHETICAL: NO 

(iv) ANTI -SENSE: NO 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Manduca Sexta 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 37; 
CCCCCCGGAT CCATGTACAA ACTGACCGTC TTCCTGATGT TCATCGCCTT CGTGATTATC 60 
GCTGAGGCCA AGAAGAACGG CTAC 84 
(2) INFORMATION FOR SEQ ID NO: 38: 

(i) SEQUENCE CHARACTERISTICS % 

(A) LENGTH: 90 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
(iii) HYPOTHETICAL: NO 
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(iv) ANTI-SENSE: HO 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM i Bombyx mori 

<xi) SEQUENCE DESCRIPTION: SEQ ID NO: 38: 
CCCCCCGOAT CCATGTTCAC CTTCGCTATT CTGCTCCTGT GCGTGCAAGG CTGCCTGATC 60 
CAGAATGTTT ACGGAAAGAA GAACGGCTAC 90 
(2) INFORMATION FOR SEQ ID NO: 39: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 90 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

<ii) MOLECULE TYPE: DNA (genomic) 

(iii) HYPOTHETICAL : NO 

(iv) ANTI- SENSE: NO 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Drosophila melanogaster 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:39: 
CCCCCCGGAT CCATGAACTA CGTCGGGCTG GGCCTCATCA TTGTGCTGTC GTGCTTGTGG 60 
CTGGGGAGCA ATGCTAAGAA GAACGGCTAC 90 
(2) INFORMATION FOR SEQ ID NO: 40: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 96 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(iii) HYPOTHETICAL: NO 

(iv) ANTI-SENSE: NO 

(vi) ORIGINAL SOURCE : 

(A) ORGANISM: Manduca eexta 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 40: 
CCCCCCGGAT CCATGGCCGC TAAATTCGTC GTGGTTCTGG CCGCTTGCGT CGCCCTGAGC 60 
CACTCGGCTA TGGTGCGCCG CAAGAAGAAC GGCTAC 96 
(2) INFORMATION FOR SEQ ID NO: 41: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 72 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(iii) HYPOTHETICAL: NO 

(iv) ANTI- SENSE: NO 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Bombyx mori 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 41: 

CCCCCCGGAT CCATGCGCGT CCTGGTGCTG TTGGCCTGCC TGGCAGCCGC TAGCGCTAAG 60 

AAGAACGGCT AC 72 

(2) INFORMATION FOR SEQ ID NO: 42: 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 75 base pairs 
CB) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(iii) HYPOTHETICAL: NO 

(iv) ANTI-SENSE: NO 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Drosophila melanogaster 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 42: 
CCCCCCGGAT CCATGTTCAA GTTCGTGATG ATCTGCGCCG TCCTCGGCCT GGCTGTGGCC 60 
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AAGAAGAACG GCTAC 

(2) INFORMATION FOR SEQ ID NO:43: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 75 base pairs 

(B) TYPE: nucleic acid 

(C) STRAND ED NESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE i DNA (genomic) 

(iii) HYPOTHETICAL: NO 

(iv) ANTI- SENSE: NO 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM; Bombyx mori 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 43: 
CCCCCCGGAT CCATOAAACT CCTGGTCGTG TTCGCCATGT GCGTGCCCGC TGCCAGCGCT 
AAGAAGAACG GCTAC 

(2) INFORMATION FOR SEQ ID NO: 44: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 60 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(iii) HYPOTHETICAL: NO 

(iv) ANTI - SENSE : NO 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Scorpion Androc tonus Australia Hector 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 44: 
GATCCGATGA AATTTCTCCT ATTGTTTCTC GTAGTCCTTC CAATAATGGG GGTGCTTGGC 

(2) INFORMATION FOR SEQ ID NO: 45: 
(i) SEQUENCE CHARACTERISTICS: 
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(A) LENGTH: 56 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE s DMA (genomic) 
(iii) HYPOTHETICAL: NO 

(iv) ANTI- SENSE: NO 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Scorpion Androc tonus Austral is Hector 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:45: 
GCCAAGCACC CCCATTATTG GAAGGACTAC GAGAAACAAT AGGAGAAATT TCATCG 56 
(2) INFORMATION FOR SEQ ID NO: 46: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 40 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(iii) HYPOTHETICAL : NO 

(iv) ANTI- SENSE: NO 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Scorpion Androctonus Australia Hector 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 46: 
AAGAAGAATG GATATGCCGT CGATAGTAGT GGTAAAGCTC 40 
(2) INFORMATION FOR SEQ ID NO: 47: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 56 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(iii) HYPOTHETICAL: NO 
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(iv) ANTI-SENSE; NO 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Scorpion Androc tonus Australia Hector 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:47: 
CTCAAAAGAC ATTCAGGAGC TTTACCACTA CTATCGACGG CATATCCATT CTTCTT 56 
(2) INFORMATION FOR SEQ ID N0:48: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 56 base pairs 

(B) TYPE: nucleic acid 

(C) STRAND EDNE S3 : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
(iii) HYPOTHETICAL: NO 
(iv) ANTI- SENSE: NO 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Scorpion Androc tonus Australia Hector 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 48: 

CTGAATGTCT TTTOAOCAAT TACTGTAACA ACGAATGCAC AAAAGTACAT TATGCT 56 

(2) INFORMATION FOR SEQ ID N0:49: 

(i) SEQUENCE CHARACTERISTICS: 
30 (A) LENGTH: 57 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

35 

(iii) HYPOTHETICAL: NO 

(iv) ANTI-SENSE: NO 

(vi) ORIGINAL SOURCE: 
4Q (A) ORGANISM: Scorpion Androc tonus Austral is Hector 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 49: 
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CAGCAATATC CTTTGTCAGC ATAATGTACT TTTGTGCATT CGTTGTTACA GTAATTQ 57 
(2) IKFORMATION FOR SBQ ID NO: 50: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 66 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
(iii) HYPOTHETICAL: NO 
(iv) ANTI-SENSE: NO 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Scorpion Androc tonus Australia Hector 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 50: 

GACAAAGQAT ATTGCTGCTT ACTTTCATGT TATTGCTTCG GTCTAAATGA CGATAAAAAA 60 

GTTTTG ^6 

(2) INFORMATION FOR SEQ ID NO: 51: 

(i) SEQUENCE CHARACTERISTICS: 
25 (A) LENGTH : 66 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 
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(ii) MOLECULE TYPE: DNA (genomic) 

(iii) HYPOTHETICAL: NO 

(iv) ANTI- SENSE: NO 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Scorpion Androc tonus Australis Hector 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 51: 
CTTGTGTCCG AAATCTCCAA AACTTTTTTA TCGTCATTTA GACCGAAGCA ATAACATGAA 60 
AGTAAG 66 
(2) INFORMATION FOR SEQ ID NO: 52: 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 52 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 
5 (D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(iii) HYPOTHETICAL: NO 

w (iv) ANTI- SENSE: NO 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Scorpion Androctonue Auatralis Hector 

75 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 52: 

GAGATTTCGG ACACAAGGAA AAGTTATTGT GACACCACAA TAATTAATTA AG 52 
(2) INFORMATION FOR SEQ ID NO: 53: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH; 39 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

Cxi) MOLECULE TYPE: DNA (genomic) 

(iii) HYPOTHETICAL: NO 

(iv) ANTI-SENSE: NO 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Scorpion Androctomis Aus trails Hector 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 53: 
AATTCTTAAT TAATTATTGT GGTGTCACAA TAACTTTTC 39 



Claims 

40 1. An isolated codon optimized nucleic acid sequence encoding Androctonus australis insect toxin (AalT), 
wherein said sequence is that depicted in SEQ ID NO. 3. 

2. An expression vector which contains the nucleic acid sequence of Claim 1 . 

45 3. The expression vector of Claim 2 wherein the expression vector is an insect virus. 

4. The expression vector of Claim 3 wherein the insect virus is selected from the group consisting of 
nuclear polyhedrosis viruses, granulosis viruses, non-occluded viruses and entomopox viruses. 

so 5. The expression vector of Claim 4 wherein the insect virus is a: (a) nuclear polyhedrosis virus selected 
from the group consisting of Lymantria dispar NPV (gypsy moth NPV), Autographa californica MNPV, 
Syngrapha falcifera NPV (celery looper NPV), Spodoptera litturalis NPV. Spodoptera frugiperda NPV, 
Heliothis armigera NPV, Mamestra brassicae NPV, Choristoneura fumiferana NPV, Trichoplusia ni NPV 
and Heliocoverpa zea NPV; (b) granulosis virus selected from the group consisting of Cydia pom one Ha 

55 GV (coddling moth GV), Pieris brassicae GV and Trichoplusia ni GV; (c) non-occluded virus selected 
from the group consisting of Orcytes rhinoceros NOV and Heliothis zea NOV; or (d) entomopox virus 
selected from the group consisting of Melolontha melonotha EPV, Amsacta moorei EPV, Locusta 
migratoria EPV, Melanoplus sanguinipes EPV, Schistocerca gregaria EPV, Aedes aegypti EPV and 
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Chironomus luridus EPV. 

6- The expression vector of Claim 5 wherein the nuclear polyhedrosis virus is Autographa californica 
MNPV. 

7. An expression vector which comprises the nucleic acid sequence of Claim 1 inserted into a baculovirus 
transfer vector. 

8. A host cell transformed with the nucleic acid sequence of Claim 1 . 

9. The host cell of Claim 8 wherein the host cell is an insect cell. 

10. A method of producing AalT which comprises transforming or infecting a host cell with the nucleic acid 
sequence of Claim 1, and culturing the host cell under conditions which permit expression of said 
nucleic acid sequence by the host cell. 

11. The method of Claim 10 in which the host cell is an insect cell. 

12. A method for protecting plants from damage from insects, which comprises delivering to said plant an 
insect virus containing the codon optimized nucleic acid sequence of SEQ ID NO: 3. 
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Not . AAG AAG AAT GGA TAT GCC CTC GAT ACT AGT GGT AAA GCT CCT GAA 

CO C...C...C...A C.TCA.TCC ..A C...C...G 

lys lys osn giy tyr olo vol osp ser ser gly lys olo pro glu 

Not. TGT CTT TTG AGC MT TAC TGT AAC AAC GAA TGC ACA AAA GTA CAT 

CO. . .C. . .G.C.C.TCG. . .C. . .T. . .C T C...6...6...C 

cys leu leu ser osn tyr cys osn osn glu cys thr lys vol his 

Not . TAT GCT GAC AAA GGA TAT TGC TGC TTA CTT TCA TGT TAT TGC 

CO. ..C G...C...C...T C.T...G...C...C 

tyr olo osp lys gly tyr cys cys leu leu ser cys tyr cys 

Not. TTC GGT CTA AAT GAC GAT AAA AAA GTT TTG GAG ATT TCG GAC ACA 

CO C...C C G C A...C...T...T...T 

phe gly leu osn osp osp lys lys vol leu glu He ser asp thr 

Not. AGG AAA AGT TAT TGT GAC ACC ACA ATA ATT AAT TAA 

CO. C.C...G...C...C C...C C... 

org lys ser tyr cys osp thr thr ile ile osn . . . 



CO. = Codon optimized 
Not. = Native 



FIG.1 
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